Applied AI Engineer

Permanent
Technology
Senior
AI, LLM, Agent Development
San Francisco, California
California
US$180000 - US$250000 per annum + Equity

Tired of brittle, unpredictable LLM agents? Come help build the IDE that makes them actually work.

I'm partnering with one of the most exciting early-stage startups in the AI tooling space - a YC-backed company that's on a mission to bring structure (and joy) to human-AI collaboration.

They're building an IDE that lets anyone design, test, and deploy sophisticated AI agents using natural language - not code. Imagine the Notion of AI systems, empowering the next billion knowledge workers to create with AI.

They're now hiring an Applied AI Engineer to help architect and scale the core agent infrastructure - from memory and evaluation to real-world reliability.


🚀 What you'll build:

  • Multi-step, tool-using agents that call real APIs, manage auth, retries, timeouts, and all the tricky edge cases.

  • RAG pipelines that turn messy data into grounded, useful answers.

  • Memory systems that persist context - scratchpads, summary buffers, embedding stores.

  • Deterministic execution and replay tools so users can trace exactly how an agent thinks.

  • A robust eval framework blending automated checks with human-in-the-loop scoring.

  • Plus whatever greenfield ideas you want to bring to life.


🧑‍💻 Who they're looking for:

  • 3+ years of engineering experience shipping production software. You've built agent-like systems: multi-step LLM workflows, tool-using bots, or scripted assistants.

  • Hands-on with:

    • RAG (embeddings, vector DBs, chunking)

    • Agent memory (scratchpads, history compression, summaries)

    • Orchestrating real tools + APIs (auth flows, plugins)

    • Evaluation - defining success metrics, running regression tests, iterating on agent behavior

  • Obsessed with fast response times, predictable outputs, traceability, and uptime. This is production, not research.

  • Thrive in fast-moving, product-first teams that bias for shipping.

Bonus points for:

  • Experience with (or strong opinions about) LangChain, CrewAI, DSPy.

  • Shipped agents used by actual customers - beyond internal demos.

  • Deep familiarity with LLM ops, tracing, observability.

  • Been a founder or early engineer who sweats the details of product quality.


🏢 The details:

  • Full-time, in-person role in San Francisco (Presidio) - 5 days a week.

  • Must have US work authorization (open to O-1 visas for exceptional folks).

  • You'll do the best work of your life alongside genuinely sharp, friendly people.


🎯 Why it matters:
Most LLM agents break in the wild. Here, you'll help build the platform that ensures they don't - enabling the world to create smarter AI systems without ever writing a line of code.

Similar Jobs

US$150000 - US$200000 per annum
New York
Data Scientist We're looking for a Data Scientist to help establish the quantitative foundation of a cutting-edge trust and validation framework for autonomous systems. In this role, you'll design rigorous statistical methodologies to evaluate system performance, develop confidence and reliability metrics, and support high-scale deployment with robust measurement systems.
€80000 - €120000 per annum + 120000
Berlin
Founding AI Engineer - Berlin (German fluency required) €100k ± 20% | equity | Full-time, on-site We're building the AI automation layer for the construction industry - live at major enterprises and already saving teams up to 90% of back-office time. Join us in Berlin as our Founding AI Engineer to shape and scale high-impact, real-world GenAI use cases from day one.
US$180000 - US$250000 per annum
New York
Data Engineer We're looking for an experienced Data Engineer to take ownership of our data platform and help scale it to meet growing demands. You'll be responsible for maintaining and optimizing data pipelines, designing new ingestion processes, and supporting the data needs of various teams including product, data science, and go-to-market.
£40000 - £60000 per annum
Norfolk
Are you a skilled Full Stack Developer with a passion for clean code, smart systems, and meaningful tech? We're working with an innovative environmental monitoring business that's on a mission to reshape how we interact with scientific data - and they need their first dedicated developer to help lead that charge. This is a unique opportunity to join a close-knit team as they rebuild their data-driven platform from the ground up.
£60000 - £70000 per annum
Cheshire
Requirements: * 5+ years of front-end experience * Expert in React and TypeScript * Hands-on AWS integration experience * Strong understanding of APIs (REST/GraphQL) * CI/CD familiarity and clean coding practices
£35000 - £40000 per annum
London
Company TEC Partners are representing one of the UK's fastest growing competition companies. They have amassed a large social media following having given out over £100 million in luxury prizes, including cars, tech and even houses! With a 4.5 Trustpilot rating and a loyal and continually expanding customer base, they have positioned themselves as one of the hottest companies to work for.
£50000 - £55000 per annum
Cheshire
A fast-growing, technology-led organisation is seeking a Full Stack Java Developer to join its collaborative engineering team. This role is well-suited to someone who's not just looking for a coding job, but who wants to get stuck in - someone with a clear desire to learn, take on ownership, and make a real contribution to the wider business and its platforms.
US$180000 - US$250000 per annum + Equity
California
I'm working with a YC-backed startup building an IDE for creating robust AI agents in natural language, think Notion for LLM systems. They need an Applied AI Engineer to design multi-step agents, RAG pipelines, memory, and evals. Full-time, in person in SF. Game-changing work at the edge of AI reliability.
£125000 - £150000 per annum + + double OTE
New York
Series A devtools startup ($60M raised) is hiring Enterprise AEs to sell a high-ACV, AI-powered platform that helps engineers rapidly build internal apps-selling into technical stakeholders (CTOs, VPs Eng, etc).
US$120000 - US$180000 per annum + Equity
California
Join a Y Combinator-backed AI startup as a Founding ML Engineer to build tools that test and monitor LLM-powered voice agents. You'll develop core infrastructure, real-time evaluation pipelines, and user-facing features. Ideal for engineers with strong Python, ML, and LLM experience who want to shape critical AI reliability tools.
US$125000 - US$150000 per annum + + double OTE
Texas
We're hiring a Founding Account Executive to lead GTM at a stealth AI startup making internal data conversational through LLMs. Work directly with a proven, multi-time founder to shape the sales motion from day one.
£350 - £450 per day
West Midlands
Job Title: Network Support Engineer Location: Birmingham - Hybrid Job Type: Contract (Inside IR35) - 9-12 months Rate: £350-450 p/day