Founding AI Engineer

Permanent
Technology
Experienced
AI, LLM, Agent Development
San Francisco, California
California
US$120000 - US$180000 per annum + Equity

Tired of LLMs breaking in the wild? Come build the platform that prevents it.

One of the most exciting early-stage startups in the AI infrastructure space is hiring a Founding Machine Learning Engineer to help shape the future of LLM observability, testing, and evaluation.

They're backed by Y Combinator and founded by IIT Bombay alumni with experience at ETH Zurich and top-tier quant trading firms. The mission? To make sure LLM-powered voice agents actually work - before they go live.

Their platform automatically simulates thousands of real-world conversations - from ordering food to handling job interviews - to stress-test agents with scale and depth. Think load testing meets GPT, with full evaluation, benchmarking, and monitoring.

🛠 What you'll build:

  • AI tools to test, evaluate and benchmark large language models (LLMs)
  • Scalable pipelines for real-time agent monitoring and performance feedback
  • Core infrastructure for LLM agent reliability
  • Customer-facing features, working directly with users and founders

🙋‍♂️ Who they're looking for:

  • Strong Python and ML engineering experience
  • Hands-on background in LLM product development or deployment
  • Interest in agent infrastructure, evaluation frameworks, or LLM testing
  • Bonus if you've worked in early-stage startups or on AI tooling

💡 This is your chance to be the technical co-founder of a product every AI team will need building at the edge of what's possible in AI reliability.

If you (or someone you rate highly) is excited by the intersection of AI agents, LLM infrastructure, and startup ownership - drop me a message. Happy to share more.

Similar Jobs

£35000 - £40000 per annum
London
Company TEC Partners are representing one of the UK's fastest growing competition companies. They have amassed a large social media following having given out over £100 million in luxury prizes, including cars, tech and even houses! With a 4.5 Trustpilot rating and a loyal and continually expanding customer base, they have positioned themselves as one of the hottest companies to work for.
£50000 - £55000 per annum
Cheshire
A fast-growing, technology-led organisation is seeking a Full Stack Java Developer to join its collaborative engineering team. This role is well-suited to someone who's not just looking for a coding job, but who wants to get stuck in - someone with a clear desire to learn, take on ownership, and make a real contribution to the wider business and its platforms.
US$180000 - US$250000 per annum + Equity
California
I'm working with a YC-backed startup building an IDE for creating robust AI agents in natural language, think Notion for LLM systems. They need an Applied AI Engineer to design multi-step agents, RAG pipelines, memory, and evals. Full-time, in person in SF. Game-changing work at the edge of AI reliability.
£125000 - £150000 per annum + + double OTE
New York
Series A devtools startup ($60M raised) is hiring Enterprise AEs to sell a high-ACV, AI-powered platform that helps engineers rapidly build internal apps-selling into technical stakeholders (CTOs, VPs Eng, etc).
US$120000 - US$180000 per annum + Equity
California
Join a Y Combinator-backed AI startup as a Founding ML Engineer to build tools that test and monitor LLM-powered voice agents. You'll develop core infrastructure, real-time evaluation pipelines, and user-facing features. Ideal for engineers with strong Python, ML, and LLM experience who want to shape critical AI reliability tools.
US$125000 - US$150000 per annum + + double OTE
Texas
We're hiring a Founding Account Executive to lead GTM at a stealth AI startup making internal data conversational through LLMs. Work directly with a proven, multi-time founder to shape the sales motion from day one.
£350 - £450 per day
West Midlands
Job Title: Network Support Engineer Location: Birmingham - Hybrid Job Type: Contract (Inside IR35) - 9-12 months Rate: £350-450 p/day
£800 - £900 per day
Berkshire
We are working on behalf of a leading organisation seeking an experienced Workday Extend & Integration Specialist to take a hands-on role in the design, build, and delivery of bespoke Workday applications. This is a fantastic opportunity to work across multiple business areas, applying modern integration techniques and enhancing enterprise systems through intelligent automation and secure, scalable design.
£40000 - £60000 per annum
Norfolk
This is a unique opportunity to join a close-knit team as they rebuild their data-driven platform from the ground up. You'll be responsible for improving the system that ingests and processes large volumes of real-time data, presents it in a user-friendly interface, and powers external integrations via robust APIs. Tech stack includes Laravel (PHP) and Vue.js - so proven experience in both is essential.
£100000 - £110000 per annum + 10% bonus
London
Key Responsibilities: Collaborate with stakeholders, engineers, and product managers to define architectural solutions. Design scalable, secure backend systems using microservices and Google Cloud. Ensure seamless integration of APIs and backend services into Unity-based game environments. Provide architectural oversight on AI enablement and automation initiatives. Create and maintain architecture diagrams, API specs (OpenAPI 3.0), and data flows.
£77000 - £116000 per annum
London
Job Title: Elasticsearch Platform Engineer Location: London - full time on-site Clearance Requirement: UK-Highest Level of Government Clearance Salary Range: £77,000 - £116,000 + Benefits
Job Title: NOC Architect / NOC SME - Network Operations Centre Location: Remote Job Type: Contract (Inside IR35) - 3 months initially w/ likely extensions Rate: £700-750 p/day