Applied AI Engineer

Permanent
Technology
Senior
AI, LLM, Agent Development
San Francisco, California
California
US$180000 - US$250000 per annum + Equity

Tired of brittle, unpredictable LLM agents? Come help build the IDE that makes them actually work.

I'm partnering with one of the most exciting early-stage startups in the AI tooling space - a YC-backed company that's on a mission to bring structure (and joy) to human-AI collaboration.

They're building an IDE that lets anyone design, test, and deploy sophisticated AI agents using natural language - not code. Imagine the Notion of AI systems, empowering the next billion knowledge workers to create with AI.

They're now hiring an Applied AI Engineer to help architect and scale the core agent infrastructure - from memory and evaluation to real-world reliability.


🚀 What you'll build:

  • Multi-step, tool-using agents that call real APIs, manage auth, retries, timeouts, and all the tricky edge cases.

  • RAG pipelines that turn messy data into grounded, useful answers.

  • Memory systems that persist context - scratchpads, summary buffers, embedding stores.

  • Deterministic execution and replay tools so users can trace exactly how an agent thinks.

  • A robust eval framework blending automated checks with human-in-the-loop scoring.

  • Plus whatever greenfield ideas you want to bring to life.


🧑‍💻 Who they're looking for:

  • 3+ years of engineering experience shipping production software. You've built agent-like systems: multi-step LLM workflows, tool-using bots, or scripted assistants.

  • Hands-on with:

    • RAG (embeddings, vector DBs, chunking)

    • Agent memory (scratchpads, history compression, summaries)

    • Orchestrating real tools + APIs (auth flows, plugins)

    • Evaluation - defining success metrics, running regression tests, iterating on agent behavior

  • Obsessed with fast response times, predictable outputs, traceability, and uptime. This is production, not research.

  • Thrive in fast-moving, product-first teams that bias for shipping.

Bonus points for:

  • Experience with (or strong opinions about) LangChain, CrewAI, DSPy.

  • Shipped agents used by actual customers - beyond internal demos.

  • Deep familiarity with LLM ops, tracing, observability.

  • Been a founder or early engineer who sweats the details of product quality.


🏢 The details:

  • Full-time, in-person role in San Francisco (Presidio) - 5 days a week.

  • Must have US work authorization (open to O-1 visas for exceptional folks).

  • You'll do the best work of your life alongside genuinely sharp, friendly people.


🎯 Why it matters:
Most LLM agents break in the wild. Here, you'll help build the platform that ensures they don't - enabling the world to create smarter AI systems without ever writing a line of code.

Similar Jobs

Negotiable
Norfolk
TEC Partners is entering its next phase of growth, and we're looking for experienced, high-performing Recruitment Consultants to help drive it. We operate across specialist Technology and Engineering markets in the UK, Europe and the US, partnering with ambitious start-ups through to global leaders. Our focus is quality delivery, long-term relationships and high-value markets. This is not a role for trainees. This is for consultants who consistently exceed targets and take real ownership
+ uncapped commission
Berkshire
TEC Partners is entering its next phase of growth, and we're looking for experienced, high-performing Recruitment Consultants to help drive it. We operate across specialist Technology and Engineering markets in the UK, Europe and the US, partnering with ambitious start-ups through to global leaders. Our focus is quality delivery, long-term relationships and high-value markets. This is not a role for trainees. This is for consultants who consistently exceed targets and take real ownership
£610 - £620 per day
London
Position: Delivery Lead - Operational Technology Location: London or Reading 2 days p/week; 3 days remote Type: Contract, Inside IR35, 6 Months Rate: £620 p/day (umbrella rate)
£530 - £540 per day
London
Position: Business Analyst - Third Party Cyber Security Location: London or Reading 2 days p/week; 3 days remote Type: Contract, Inside IR35, 6 Months Rate: £540 p/day (umbrella rate)
Position: Business Analyst - Manufacturing & Operational Technology Location: London or Reading 2 days p/week; 3 days remote Type: Contract, Inside IR35, 6 Months Rate: £540 p/day (umbrella rate)
€70000 - €100000 per annum
Bayern (Bavaria)
# ℹ️ About the role We're looking for an **ML Platform Engineer** who helps us scale our AI infrastructure and optimize our data operations. As a key technical contributor, you'll own the analytics infrastructure that powers our decision-making, build robust data pipelines that feed our AI models, and dive deep into both customer and internal data to unlock optimization opportunities.
£40000 - £50000 per annum
Remote
Position: Marketing Operations Analyst Type: Permanent Location: Remote - occasional travel to London Salary: £40-50K
£50000 - £60000 per annum
Remote
Position: GRC Analyst - Cyber Security Type: Permanent Location: Remote, UK-based Salary: £50-60K
£27000 - £31000 per annum
London
Position: IT Support Engineer - 1st Line Location: Onsite, Acton Type: Permanent Salary: £27-31K
£600 - £700 per day
Berkshire
We are working with a large, technology-driven organisation seeking a Senior Software Product Engineer to join its Shared Services function. This role will focus on expanding and enhancing Amazon Connect capabilities, delivering innovative contact centre solutions that provide measurable value across the wider business.
Position: Threat Defence Delivery Manager Location: London/Hybrid Type: Contract, Inside IR35, 6 Months Rate: £700-725 p/day
Position: Identity & Access Management Workstream Lead (IAM/IDAM) Location: London/Hybrid Type: Contract, Inside IR35, 6 Months Rate: £700-725 p/day