Rigor
We hold ourselves to the highest scientific standards. Every claim is backed by evidence, every benchmark is reproducible.
We're growing fast — and actively hiring across research, engineering, design, and community. Join us at the ground floor and help shape how intelligence learns through practice.
We're a small, focused team with outsized ambition — and we're growing. Our research is published at top venues and our tools are used by the community worldwide. If you want to work on frontier problems, ship things that matter, and do it alongside people who care deeply about getting it right, this is the place.
We hold ourselves to the highest scientific standards. Every claim is backed by evidence, every benchmark is reproducible.
We follow questions wherever they lead. The best research comes from genuine wonder about how intelligence works.
Like a musician perfecting a passage, we care deeply about the quality and elegance of our work, from code to papers.
Our research, tools, and data are shared freely. We believe open science accelerates progress for everyone.
Find Your Part
Every voice in the ensemble matters. We're looking for people who bring both precision and passion.
6 roles available — actively interviewingLead original research into practice-based learning and deliberate practice for AI agents. You'll design experiments that probe how agents improve through structured repetition and self-assessment, publish findings at top venues, and help define our core research agenda.
Requirements: PhD or equivalent research experience in ML/AI; publications at NeurIPS, ICML, ICLR, or ACL; deep interest in agent learning, evaluation methodology, and skill acquisition; strong Python and experiment-design fundamentals.
Build the infrastructure that makes agent training loops possible. You'll design feedback pipelines, experiment orchestration systems, and the scaffolding that lets agents practice, fail, and improve at scale — the engineering backbone of our core research.
Requirements: Strong software engineering fundamentals; hands-on experience building or evaluating LLM-based agents; proficiency in Python and modern ML frameworks; comfort shipping production-quality systems alongside research prototypes.
Own the evaluation infrastructure and developer-facing APIs that the community depends on. You'll build scalable systems for large-scale model evaluation, design reliable data pipelines, and ship the open-source tooling that makes rigorous AI benchmarking accessible to everyone.
Requirements: 5+ years in backend or platform engineering; experience with distributed systems and cloud infrastructure (AWS/GCP); track record of building and maintaining well-documented open-source projects; Go, Rust, or Python at production scale.
Take practice-based agents from research into enterprise deployments. You'll work directly with customers to understand their workflows, adapt our agent systems to real-world constraints, and close the loop between production feedback and the research team.
Requirements: Experience deploying ML systems to production; familiarity with LLM fine-tuning, RLHF, or agent frameworks; strong Python engineering skills; comfort navigating ambiguous customer requirements and translating them into technical specs.
Shape how people interact with AI agents. You'll design interfaces for agent monitoring, feedback collection, and practice-loop visualization — making complex agent behaviour legible and actionable for researchers and end users alike.
Requirements: 3+ years of product or interaction design; experience designing developer tools or data-heavy interfaces; strong Figma skills; genuine curiosity about AI agent behaviour and how to surface it clearly; bonus: experience with ML or research tooling.
Be the voice of Etude AI in the open-source community. You'll write the documentation, tutorials, and blog posts that help developers understand and use our tools; engage with community contributors on GitHub; and help us tell the story of what we're building and why it matters.
Requirements: Proven track record of technical writing for developer audiences; comfort reading and writing Python; experience growing open-source communities or developer advocacy programs; strong editorial instincts and the ability to explain hard ideas clearly.
The Ensemble's Benefits
We're building something rare: a small team doing frontier research with real resources, real equity, and real freedom. Here's what that looks like in practice.
Work on problems that matter and publish at top venues. We actively encourage — and budget for — publications at NeurIPS, ICML, ICLR, ACL, and CVPR.
Your contributions are visible and consequential from day one. No layers of approvals, no competing roadmaps — just focused work on hard problems with people who notice the details.
We offer meaningful equity packages at an early stage — with upside that reflects the ground-floor opportunity. Competitive salary, retirement matching, and education stipend included.
Run the experiments you need to run. We provide access to the compute resources that make serious research possible, without bureaucratic approval chains slowing you down.
Comprehensive medical, dental, and vision coverage. Mental health support. 16 weeks paid parental leave. Fertility benefits. Flexible PTO — including our protected Open Source Fridays.
Work from wherever you do your best thinking. Home office budget, remote-first culture, and a team distributed across North America. Annual retreats to bring everyone together in person.
A Culture of Practice
We practice what we preach. Just as our agents learn through deliberate repetition, our team grows through shared inquiry and weekly rituals that keep everyone sharp.
Every week the team gathers to present work-in-progress, share recent papers, or dig into an open problem. Debate is encouraged; hand-waving is not.
Rotating reading groups cover the latest from NeurIPS, ICML, ICLR, and beyond. We read critically, not just to stay current but to understand what questions aren't being asked yet.
Friday afternoons are protected time for open-source contributions, tooling improvements, and community work. No meetings, no sprints — just building things you're proud to share.
We sponsor attendance at the top venues — NeurIPS, ICML, ICLR, ACL, CVPR. Presenting or attending, the budget is there. Staying connected to the broader research community is part of the job.
The Path Forward
We keep things transparent and move quickly. Here's what to expect.
We review your background and research interests.
45-min video call with a team member.
Present your work or solve a research problem.
Meet potential collaborators across teams.
We move fast. Expect a decision within a week.
We're growing quickly and new roles open often. If you share our passion for rigorous AI research, deliberate practice, and open science — we want to hear from you now, before the next role is posted.
Get in touch