Senior AI Research Engineer, Model Inference (100% Remote)
Компания: Tether Operations Limited
Город , Astana,
Зарплата:
Размещено: 2025-10-02 00:00:00
Описание
Join Tether and Shape the Future of Digital Finance
At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost. Transparency is the bedrock of everything we do, ensuring trust in every transaction.
Innovate with Tether
Tether Finance: Our innovative product suite features the world’s most trusted stablecoin, USDT, relied upon by hundreds of millions worldwide, alongside pioneering digital asset tokenization services.
But that’s just the beginning:
Tether Power: Driving sustainable growth, our energy solutions optimize excess power for Bitcoin mining using eco-friendly practices in state-of-the-art, geo-diverse facilities.
Tether Data: Fueling breakthroughs in AI and peer-to-peer technology, we reduce infrastructure costs and enhance global communications with cutting-edge solutions like KEET, our flagship app that redefines secure and private data sharing.
Tether Education: Democratizing access to top-tier digital learning, we empower individuals to thrive in the digital and gig economies, driving global growth and opportunity.
Tether Evolution: At the intersection of technology and human potential, we are pushing the boundaries of what is possible, crafting a future where innovation and human capabilities merge in powerful, unprecedented ways.
Why Join Us?
Our team is a global talent powerhouse, working remotely from every corner of the world. If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards. We’ve grown fast, stayed lean, and secured our place as a leader in the industry.
If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you.
Are you ready to be part of the future?
About the job:
We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine-tuning for Language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).
This role requires hands-on experience with quantization techniques, LoRA architectures, Vulkan backend, and mobile GPU debugging. You will play a critical role in pushing the boundaries of desktop and on-device inference and fine-tuning performance for next-generation SLM/LLMs.
Responsibilities:
Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware backends.
Implement and optimize full and LoRA fine-tuning for small and large language models across multiple hardware backends.
Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).
Design, customize, and optimize Vulkan compute shaders for quantized operators and fine-tuning workflows.
Investigate and resolve GPU acceleration issues on Vulkan and integrated/mobile GPUs.
Architect and prepare support for advanced quantization techniques to improve efficiency and memory usage.
Debug and optimize GPU operators (e.g., int8, fp16, fp4, ternary).
Integrate and validate quantization workflows for training and inference.
Conduct evaluation and benchmarking (e.g., perplexity testing, fine-tuned adapter performance).
Conduct GPU testing across desktop and mobile devices.
Collaborate with research and engineering teams to prototype, benchmark, and scale new model optimization methods.
Deliver production-grade, efficient language model deployment for mobile and edge use cases.
Work closely with cross-functional teams to integrate optimized serving and inference frameworks into production pipelines designed for edge and on-device applications. Define clear success metrics such as improved real-world performance, low error rates, robust scalability, optimal memory usage and ensure continuous monitoring and iterative refinements for sustained improvements.
Proficiency in C++ and GPU kernel programming.
Proven Expertise in GPU acceleration with Vulkan framework.
Strong background in quantization and mixed-precision model optimization.
Experience and Expertise in Vulkan compute shader development and customization.
Familiarity with LoRA fine-tuning and parameter-efficient training methods.
Ability to debug GPU-specific performance and stability issues on desktop and mobile devices.
Hands-on experience with mobile GPU acceleration and model inference.
Familiarity with large language model architectures (e.g., Qwen, Gemma, LLaMA, Falcon etc.).
Experience implementing custom backward operators for fine-tuning.
Experience creating and curating custom datasets for style transfer and domain-specific fine-tuning.
Demonstrated ability to apply empirical research to overcome challenges in model
Important information for candidates Recruitment scams have become increasingly common. To protect yourself, please keep the following in mind when applying for roles:
Apply only through our official channels. We do not use third-party platforms or agencies for recruitment unless clearly stated. All open roles are listed on our official careers page: https://tether.recruitee.com/
Verify the recruiter’s identity. All our recruiters have verified LinkedIn profiles. If you’re unsure, you can confirm their identity by checking their profile or contacting us through our website.
Be cautious of unusual communication methods. We do not conduct interviews over WhatsApp, Telegram, or SMS. All communication is done through official company emails and platforms.
Double-check email addresses. All communication from us will come from emails ending in @tether.to or @tether.io
We will never request payment or financial details. If someone asks for personal financial information or payment at any point during the hiring process, it is a scam. Please report it immediately.
When in doubt, feel free to reach out through our official website.
... identify automation opportunities and implement AI-driven solutions. Ensure that AI-powered workflows align with business goals and drive measurable impact. Optimize AI model performance and workflow automation based ...
Компания: Home Credit BankГород:Казахстан, Алматы
Зарплата: Размещено:
almaty.hh.kz
Senior AI Research Engineer, Model Inference (100% Remote)
... AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine ... research to overcome challenges in modelImportant ...
Faculty Position (Open Rank) in AI/Applied AI for Human and Social Sciences
... supervising student projects and research in applied AI. Industry experience would also be viewed favorably. Applicants for senior lecturer positions should have: Required: ... student projects and research in applied AI. Industry experience would also be ...
Компания: KIMEP UniversityГород:, Almaty,
Зарплата: Размещено:
kz.talent.com
Senior AI Engineer
... deploy state-of-the-art AI systems that redefine industry standards, fostering significant business impact while contributing to innovation in a dynamic environment. This position offers remote setup with the flexibility to ...
Компания: Epam Kazakhstan (Эпам Казахстан),ТООГород:Казахстан, Астана
Зарплата: Размещено:
astana.hh.kz
Senior Research Engineer - Multimodal & Video Foundation Model (100% Remote)
... model architectures with a hands-on, research-driven approach. Your mission is ... novel AI architectures for multimodal language models, integrating text, visual, and audio modalities.Engineer scalable training and inference pipelines optimized for large- ...
Senior Research Engineer Multimodal & Video Foundation Model (100% Remote)
... model architectures with a hands-on, research-driven approach. Your mission is ... novel AI architectures for multimodal language models, integrating text, visual, and audio modalities.Engineer scalable training and inference pipelines optimized for large- ...
NATIONAL CONSULTANT FOR CONDUCTING THE AI READINESS ASSESSMENT IN KAZAKHSTAN
... November 2021 UNESCO developed the (AI), which was adopted by acclamation ... , sustainable and equitable outcomes of AI. In this regard, with the ... project in Kazakhstan, including all research and analytical components, and to ...
Компания: UNESCOГород:, Kazakhstan,
Зарплата: Размещено:
kz.talent.com
Senior AI Engineer
... are looking for a Senior AI Engineer Employment type: Full time Candidate ... is seeking a founding AI engineer who will develop advanced AI workflows and integrate them across ... data preprocessing, feature engineering, model evaluation Knowledge of MLOps and ...
Компания: BonapoliaГород:, Almaty,
Зарплата: Размещено:
kz.talent.com
Senior AI Engineer
... for a Senior AI Engineer to lead the development and adoption of AI-driven solutions across the company. This is a hands-on, high-impact role where youll drive innovation, build production-ready systems, and help integrate AI into key ...
Компания: WavesReachГород:, ,
Зарплата: Размещено:
kz.talent.com
Senior AI & NLP Engineer
... a highly skilled Senior AI & NLP Engineer to join our innovative team. In this role, you will apply your expertise in large language models (LLMs), fine-tuning, prompting, and retrieval-augmented generation (RAG) to build cutting-edge AI solutions. You ...
Компания: ЧК ITS PARTNER LTD.Город:Казахстан, Алматы
Зарплата: Размещено:
almaty.hh.kz
AI-native QA Team Lead
... an experienced and forward-thinking AI-native QA Team Lead to ... workflows to align with an AI-first mindset. Collaborate closely with ... . Hands-on experience with LLM AI tools (beyond ChatGPT — actual agent ...
Компания: TothemoonГород:, Almaty,
Зарплата: Размещено:
kz.talent.com
Policy Research Fellow Jr.
... future leaders in policy research, human rights, and related disciplines. ... are invited for Junior Research Fellowship (for candidates enrolled passed ... one of the researchable research areas for their research. Research papers produced by candidates will ...
Компания: European Institute of Policy Research and Human Rights SIAГород:, Astana,
Зарплата: Размещено:
kz.talent.com
AI Integration Engineer / AI Automation Specialist (Remote)
... headhunting firm dedicated exclusively to AI and emerging technology talent. Ceed ... to recruit world-class AI professionals — from LLM Engineers and AI Architects to Automation Specialists and ... gain access to cutting-edge AI opportunities with some of the ...
Компания: Lion GroupГород:, ,
Зарплата: Размещено:
kz.talent.com
ML Engineer
... Andersen invites a ML Engineer to join our dynamic and ... production (batch or real-time inference). Monitoring and maintaining model performance and data quality. Optimizing ... managed ML services (SageMaker, Vertex AI, etc.). Experience with MLFlow, DVC, ...
Компания: AndersenГород:Казахстан, Алматы
Зарплата: Размещено:
almaty.hh.kz
AI Integration Engineer / AI Automation Specialist (Remote)
... with APIs, Zapier Make, and AI integration frameworksSaaS or fintech project ... headhunting firm dedicated exclusively to AI and emerging technology talent. Ceed ...
Компания: SnaphuntГород:, ,
Зарплата: Размещено:
kz.talent.com
Senior Software Engineer, AI Model serving - Asia
... were previously leaders and senior engineers at companies like Snapchat, ... frontend and backend engineers, AI research scientists, and others from Amazon, ... .OverviewAs Speechify expands, our AI team seeks a Senior Backend Engineer. This role is central to ...
Компания: SpeechifyГород:, ,
Зарплата: Размещено:
kz.talent.com
Senior Software Engineer, AI Model serving - Asia
... frontend and backend engineers, AI research scientists, and others from Amazon, ... .OverviewAs Speechify expands, our AI team seeks a Senior Backend Engineer. This role is central to ... were previously leaders and senior engineers at companies like Snapchat, ...
Компания: SnaphuntГород:, ,
Зарплата: Размещено:
kz.talent.com
Software Engineer, Data Infrastructure & Acquisition - Asia
... to support our model training operations. We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. We are looking for a skilled Software Engineer to join ...
Компания: SpeechifyГород:, ,
Зарплата: Размещено:
kz.talent.com
Software Engineer, Data Infrastructure & Acquisition (Asia)
... to support our model training operations. We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. We are looking for a skilled Software Engineer to join ...
Компания: SnaphuntГород:, ,
Зарплата: Размещено:
kz.talent.com
Python Backend Engineer (Middle+)
... Engineering Employment Type: Full-time, remote About zypl.ai zypl.ai is a fast-growing AI and FinTech innovator, building scalable ... a Middle+ Python Backend Engineer with a strong background in ... and or with ML model–driven products; Education: bachelor’s degree ( ...
Компания: ЗИПЛ.АИ, ИНК. КЗГород:Казахстан, Алматы
Зарплата: Размещено:
almaty.hh.kz
Machine Learning Engineer (AI)
... microservices that power our core AI features Design the brains behind ... a dynamic and innovative Tech AI environment Competitive salary package Vacation days 25, additional days off, and sick leave Flexible working options: remote or hybrid in one of ...
Компания: Meteoro PlatformГород:Казахстан, Алматы
Зарплата: Размещено:
almaty.hh.kz
AI engineer
... . Построение production-ready пайплайнов для inference моделей, включая LLM и аудио- ... техническое: Computer Science, Data Science, AI, математика и т.п. Опыт: Опыт в NLP Voice AI LLM 1+ года. Опыт разработки ...
Компания: Банк ЦентрКредитГород:Казахстан, Алматы, проспект Аль-Фараби, 38
Зарплата: Размещено:
almaty.hh.kz
(JRFP)-Fellow- Junior Researcher
... future leaders in policy research, human rights, and related disciplines. ... are invited for Junior Research Fellowship (for candidates enrolled passed ... one of the researchable research areas for their research. Research papers produced by candidates will ...
Компания: European Institute of Policy Research and Human Rights SIAГород:, Astana,
Зарплата: Размещено:
kz.talent.com
Senior AI Automation Engineer/DevOps
... a highly skilled and experienced Senior AI Automation Engineer to join our dynamic team ... of potential threatsRole OverviewAs an AI Automation Engineer, you will be at the ... tools, and implementing intelligent, AI-driven workflows that drive efficiency, ...
Компания: InteticsГород:, ,
Зарплата: Размещено:
kz.talent.com
Senior Software Engineer (Python + AI)
... looking for a motivated Senior Software Engineer (Python + AI) who is willing to dive ... innovative enterprise technology and AI-driven tools to support the ... and in developing agentic AI applications to enhance AI interaction● Understanding of RAG pipelines ...
Компания: BonapoliaГород:, Almaty,
Зарплата: Размещено:
kz.talent.com
Assistant Professor, Department of Electrical and Computer Engineering, School of Engineering and Digital Sciences
... and Digital Sciences. Applicants with research and or teaching interests are ... strong commitment to teaching and research.Position responsibilities include an average ... .Proven track record of impactful research in one or more of ...
Компания: Nazarbayev UniversityГород:, Astana,
Зарплата: Размещено:
kz.talent.com
Middle AI implementer / специалист по внедрению ИИ
... Monitor, evaluate, and continuously improve AI adoption and performance Requirements: Proven, hands-on experience implementing AI in small to mid-sized ... Track record of building real AI tools — not just experiments or ...
Компания: Orlanda Engineering OÜГород:Казахстан, Астана
Зарплата: Размещено:
astana.hh.kz
Руководитель направления AI Engineering / Team Lead
... . Настраивать мониторинг и алертинг метрик AI-сервисов и инфраструктуры (Prometheus, Grafana, ... . Требования: Опыт: 5+ лет в AI ML-инжиниринге, из них 2+ ... . Опыт публикаций или выступлений на AI DevOps-конференциях. Практика внедрения GitOps ...
Компания: КазахтелекомГород:Казахстан, Астана
Зарплата: Размещено:
astana.hh.kz
AI Senior Expert
Мы ищем AI Senior Expert, который станет ключевым ... ).Разработка и поддержка AI-агентов и ассистентов для автоматизации бизнес-процессов.Интеграция AI-инструментов в ERP, CRM, мобильные ... формировать и реализовывать AI-стратегию компании.Доступ к современным ...
Компания: Мясная продукцияГород:Казахстан, Астана
Зарплата: Размещено:
astana.hh.kz
Senior ML Engineer (Time Series & MLOps)
KOZ AI строит production-AI-решения для реального бизнеса: от цифровых платформ до систем предиктивной аналитики. Нам нужен Senior ML Engineer, который умеет работать с ... .0 FTE. Гибрид или remote — главное, чтобы полная вовлечённость. Конкурентная ...
Компания: KOZ AIГород:Казахстан, Алматы
Зарплата: Размещено:
almaty.hh.kz
Senior Automation Engineer (AI)
... are looking for a Senior Automation Engineer (AI):Tech Level: Senior LeadLanguage Proficiency: Upper-IntermediateEmployment type: ... on building an AI-powered browser automation system. The ... are looking for a Senior Engineer who can own this exploration ...
Компания: BonapoliaГород:, Almaty,
Зарплата: Размещено:
kz.talent.com
Postdoctoral Scholar, Department of Languages, Linguistics and Literature, School of Sciences and Humanities
... PhD student thesis and dissertation research.QualificationsPhD in linguistics, literature, or ... international university· An active research agenda (as evident in peer- ... cover letter with teaching and research statements, names and contact information ...
Компания: Nazarbayev UniversityГород:, Astana,
Зарплата: Размещено:
kz.talent.com
Computer Vision Lead Engineer
... OpenVINO или аналоги для оптимизации inference. Понимание архитектур нейросетей для CV ( ... участие в соревнованиях Kaggle, CVPR, AI Challenge. Условия Участие в крупномасштабном ... . Работа с современными технологиями (AI, ML, Big Data, Federated Architecture). ...
... (Node 20 + TypeScript) model transactional and market‑data workflows ... to cutting-edge AI-driven WealthTech solutions a collaborative and inclusive corporate culture that values innovation and initiative periodic remote (e.g., during winter) from ...
Компания: INVESTBANQ TECH LAB LIMITEDГород:Казахстан, Алматы, проспект Ермека Серкебаева, 101
Зарплата: Размещено:
almaty.hh.kz
Software Engineer, iOS Core Product - Asia
... include frontend and backend engineers, AI research scientists, and others from Amazon, ... find the need for a Senior iOS Engineer to help us support the ... -growth startup with a busy, remote team. You know how and ...
Компания: SpeechifyГород:, ,
Зарплата: Размещено:
kz.talent.com
Software Engineer, iOS Core Product (Asia)
... include frontend and backend engineers, AI research scientists, and others from Amazon, ... find the need for a Senior iOS Engineer to help us support the ... -growth startup with a busy, remote team. You know how and ...
Компания: SnaphuntГород:, ,
Зарплата: Размещено:
kz.talent.com
Senior Backend Engineer
... , flexibility, creativity, and tangible outcomes.Senior Backend Engineer - Node.js, TypeScriptIf you’re looking for a fully remote role with flexible hours, unlimited ... stage of rapid growth.As Senior Backend Engineer, you’ll own the systems that ...
Компания: BlueThroneГород:, Astana,
Зарплата: Размещено:
kz.talent.com
Senior AI-специалист (Generative AI)
Ищем Senior AI-специалиста (Generative AI) в команду ForteРаботаем над внедрением AI в ключевые процессы Банка: клиентский ... использование OCR.Участие в разработке AI-продуктов: от идеи до пилота ...
Компания: ForteBankГород:Казахстан, Астана
Зарплата: Размещено:
astana.hh.kz
Machine Learning / Computer Vision Engineer
... Engineer to design, train, and develop ... turn research code into production-ready pipelines (Docker, CI CD, cloud GPU jobs). Optimise inference speed and memory footprint (ONNX, TensorRT, model pruning quantisation, CUDA ... Remote-first culture with a flexible ...