AI Research Engineer (Multi-Modal & Vision) - 100% Remote Worldwide
Join Tether and Shape the Future of Digital Finance
At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost. Transparency is the bedrock of everything we do, ensuring trust in every transaction.
Innovate with Tether
Tether Finance: Our innovative product suite features the world’s most trusted stablecoin, USDT, relied upon by hundreds of millions worldwide, alongside pioneering digital asset tokenization services.
But that’s just the beginning:
Tether Power: Driving sustainable growth, our energy solutions optimize excess power for Bitcoin mining using eco-friendly practices in state-of-the-art, geo-diverse facilities.
Tether Data: Fueling breakthroughs in AI and peer-to-peer technology, we reduce infrastructure costs and enhance global communications with cutting-edge solutions like KEET, our flagship app that redefines secure and private data sharing.
Tether Education: Democratizing access to top-tier digital learning, we empower individuals to thrive in the digital and gig economies, driving global growth and opportunity.
Tether Evolution: At the intersection of technology and human potential, we are pushing the boundaries of what is possible, crafting a future where innovation and human capabilities merge in powerful, unprecedented ways.
Why Join Us?
Our team is a global talent powerhouse, working remotely from every corner of the world. If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards. We’ve grown fast, stayed lean, and secured our place as a leader in the industry.
If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you.
Are you ready to be part of the future?
About the job
As a member of the AI model team, you will drive innovation in training and optimizing vision-language models with a focus on real-world deployment. Your work will span the full model development lifecycle - from data curation and training pipeline design to model evaluation and optimization - with the goal of building models that are both highly capable and practical to deploy at scale.
You will work across a wide spectrum of multimodal architectures integrating text and vision, applying state-of-the-art research to improve model quality, efficiency, and domain-specific performance. We expect you to bring a research-driven mindset combined with strong engineering discipline - someone who can identify the right technique for a given problem, implement it rigorously, and measure its impact clearly.
You will work closely with a small, high-caliber team where your contributions will have direct and meaningful impact. If you are passionate about pushing the boundaries of what multimodal AI can achieve in production environments, this is your opportunity.
Responsibilities
Conduct end-to-end research and engineering on vision-language models, covering training, evaluation, and optimization across the full model development lifecycle.
Design and implement post-training pipelines including supervised fine-tuning, knowledge distillation, and reinforcement learning from human feedback.
Develop and maintain high-quality multimodal datasets, including data curation, filtering, and balancing for domain-specific tasks.
Drive model efficiency and deployability, adapting models for resource-constrained environments using compression and optimization techniques.
Design and implement evaluation frameworks and benchmarks to measure model performance, robustness, and real-world task success.
Build and scale training workflows across distributed GPU infrastructure.
Identify and resolve bottlenecks in training pipelines to achieve state-of-the-art model quality on target benchmarks.
Contribute to and leverage open-source ecosystems including models, datasets, and tooling to accelerate development.
Stay current with the latest research in multimodal learning and vision-language systems, translating relevant findings into practical improvements.
Publish research findings in top-tier AI conferences and journals where applicable.