Jeeves logo
Jeeves

Senior AI Engineer

🕐 29 dias atrás📍 Mexico🌍 Remoto

What You'll Do:

LLM & AI Pipeline Engineering

  • Design, build, and maintain production-grade LLM integration pipelines — including retrieval-augmented generation (RAG), prompt engineering, output parsing, and chain orchestration.
  • Develop and operate AI features within Jeeves's core financial products: spend categorization, document extraction, anomaly detection, financial Q&A, and automated reconciliation.
  • Implement structured output validation, fallback handling, and confidence scoring to ensure AI decisions meet reliability standards for financial use cases.
  • Evaluate and integrate AI frameworks and tools (LangChain, LlamaIndex, OpenAI API, Anthropic API, HuggingFace, vector databases) and advocate for the right tool for the job.
  • Establish prompt versioning and evaluation practices to ensure AI outputs remain accurate and consistent as models and data evolve.

Retrieval & Vector Search

  • Design and maintain vector search pipelines using databases such as Pinecone, Weaviate, or pgvector to power semantic search and RAG-based features.
  • Build document ingestion and chunking pipelines for Jeeves's financial data — processing invoices, receipts, policy documents, and transaction records.
  • Optimize retrieval quality through embedding model selection, chunk strategy, metadata filtering, and re-ranking techniques.

ML Model Serving & Operations

  • Collaborate with data scientists to take trained ML models from experimental notebooks to production serving infrastructure.
  • Build and maintain model serving endpoints with appropriate latency SLOs, input validation, and output monitoring.
  • Implement model performance monitoring and data drift detection to ensure production models remain accurate over time.
  • Support model retraining workflows by designing clean data pipelines and feature engineering that can be continuously updated.

Backend Integration & Reliability

  • Integrate AI services cleanly with Jeeves's backend microservices — designing clear API contracts, circuit breakers, and graceful degradation patterns.
  • Write high-quality, testable backend code in Python or Go/Node.js to power AI-integrated features.
  • Instrument AI components with structured logging, distributed tracing, latency dashboards, and alerting to ensure operational visibility.
  • Build human-in-the-loop review workflows for AI decisions that require oversight — particularly for high-value financial actions.

Collaboration & Growth

  • Partner with Product, Backend Engineering, and Data Science to define the AI roadmap and translate requirements into reliable systems.
  • Contribute to a culture of quality by writing design docs, reviewing peers' AI system designs, and sharing learnings openly.
  • Help grow the AI engineering practice at Jeeves by establishing patterns, tooling, and best practices that the broader team can build on.

Requirements:

Minimum Requirements

  • Bachelor's degree in Computer Science, Engineering, or a related field — or equivalent practical experience.
  • 5+ years of professional software engineering experience, with at least 3 years focused on AI/ML systems in production.
  • Hands-on experience building and deploying LLM-powered applications using APIs such as OpenAI, Anthropic, or Cohere in a production environment.
  • Experience designing and operating RAG pipelines, including chunking strategies, embedding models, and vector database integration (Pinecone, Weaviate, pgvector, or similar).
  • Strong proficiency in Python for AI/ML workloads; familiarity with at least one AI orchestration framework (LangChain, LlamaIndex, or equivalent).
  • Experience with ML model serving infrastructure: REST or gRPC inference endpoints, input/output validation, latency budgeting, and monitoring.
  • Solid backend engineering fundamentals: REST APIs, relational databases (PostgreSQL preferred), async patterns, and cloud infrastructure (AWS, GCP, or Azure).
  • Experience with observability tooling: structured logging, distributed tracing, and building dashboards for AI system health.

Preferred Qualifications

  • Experience in fintech, financial services, or any regulated industry where AI reliability and auditability are critical.
  • Familiarity with prompt evaluation frameworks, A/B testing AI outputs, and tracking model performance degradation in production.
  • Experience with ML lifecycle management tools: MLflow, Weights & Biases, Vertex AI, or SageMaker.
  • Knowledge of real-time data streaming (Kafka, Kinesis) for event-driven AI pipelines.
  • Contributions to open-source AI tooling, published technical writing, or talks at AI/ML conferences.
  • Prior startup or scale-up experience — comfortable with ambiguity and building foundational systems from scratch.

🇧🇷 Essa vaga exige inglês. Você está pronto?

A DevSpeak Academy prepara desenvolvedores brasileiros para conquistar vagas internacionais. Domine o inglês técnico com professores que entendem o mundo dev.

Conheça a DevSpeak Academy