We are seeking a Lead AI Engineer to design, build and scale cutting-edge AI applications powered by large language models. In this role, you will partner with clients to deliver tailored LLM-driven solutions, architect agentic systems and drive the adoption of emerging AI technologies across enterprise environments.
EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
Responsibilities
-
Design, implement and maintain end-to-end AI applications, including chatbots, Q&A platforms, agent workflows and other LLM-driven solutions
-
Collaborate directly with clients to understand their needs, identify opportunities and recommend tailored AI/LLM solutions that drive business value
-
Architect and optimize robust data pipelines, prompt strategies and datasets to ensure effective, accurate and scalable AI models
-
Evaluate, monitor and refine AI system performance, ensure outputs are accurate, secure, scalable and compliant with industry regulations and best practices
-
Conduct research, design experiments and perform rapid prototyping to validate technical feasibility and demonstrate the business value of AI solutions
-
Stay current with evolving LLM technologies, frameworks, protocols (such as MCP, A2A, ACP) and methodologies, continuously improve solution quality and client outcomes
-
Design and implement agentic systems with frameworks such as LangChain, LangGraph and Semantic Kernel, integrate with vector databases and advanced memory architectures
-
Develop and maintain APIs and system integrations for production-grade AI applications, including enterprise system integration (CRM, ERP, databases)
-
Deploy AI solutions at scale, consider performance, cost-efficiency, maintainability, observability and security (including guardrails and prompt injection prevention)
-
Implement and monitor retrieval systems (keyword search, vector search, embeddings), ranking algorithms and agent evaluation frameworks
-
Use MLOps/AIOps practices for agentic systems and ensure robust observability and monitoring of deployed solutions
-
Clearly communicate complex technical concepts and AI strategies to both technical and non-technical stakeholders, iterate on models based on user feedback
Requirements
-
Strong proficiency in at least one modern programming language (such as Python, Java, C#, Go, etc.); experience with web frameworks like FastAPI or similar is a plus
-
Deep understanding of the AI application development lifecycle, including production deployment, system integration and rapid UI prototyping (Streamlit, Gradio or similar)
-
Familiarity with major LLM platforms and APIs (OpenAI, Anthropic, Amazon Bedrock, Gemini) and related frameworks (LangChain, LangGraph, LlamaIndex, Strands Agents, etc.)
-
Knowledge of advanced AI integration patterns (e.g., RAG, agent orchestration, tool calling), retrieval systems (keyword/vector search, embeddings) and ranking algorithms
-
Experience to deploy AI solutions at scale, with a focus on performance, cost-efficiency, maintainability, observability and security (including guardrails and prompt injection prevention)
-
Proven ability to evaluate generative AI quality with retrieval/classification scores, LLM-based evaluation, agent evaluation metrics and A/B testing
-
Experience with vector databases (Pinecone, Weaviate, ChromaDB, FAISS) and semantic/hybrid search
-
Experience to design experiments, conduct A/B tests and iterate on models based on user feedback
-
Experience with enterprise system integration (CRM, ERP, databases) and deployment to cloud AI platforms or on-premise solutions
-
Experience with observability and monitoring tools/frameworks, and application of MLOps/AIOps practices for agentic systems
-
Familiarity with emerging protocols (MCP, A2A, ACP) and advanced memory architectures
-
Proven experience in AI engineering and delivery of ML-based solutions in production environments
-
Strong problem-solving skills, attention to detail and ability to work independently and collaboratively
-
Excellent communication, collaboration and interpersonal skills, with the ability to explain complex technical concepts to non-technical stakeholders
Technologies
-
Proficiency in at least one modern programming language (e.g., Python, Java, C#, Go, etc.) for AI development
-
Web frameworks: FastAPI, Streamlit, Gradio, Flask, Spring Boot, ASP.NET or similar
-
Major LLM platforms and APIs: OpenAI, Anthropic, Amazon Bedrock, Gemini
-
Agentic frameworks: LangChain, LangGraph, Semantic Kernel, LlamaIndex, Strands Agents
-
Data pipeline and integration tools
-
Vector databases: Qdrant, FAISS, Chroma, Pinecone, Weaviate, ChromaDB
-
Retrieval and ranking systems: keyword search, vector search, embeddings, ranking algorithms
-
Cloud AI platforms: Azure OpenAI, Amazon Bedrock, GCP Vertex AI
-
On-premise solutions: vLLM
-
Enterprise AI platforms: AWS AgentCore, Databricks AgentBricks, Google Agents Space, Azure AI Foundry
-
Observability and monitoring tools/frameworks
-
MLOps/AIOps practices for agentic systems
-
Security and guardrail tools for AI applications
-
Protocols: MCP, A2A, ACP
-
Advanced memory architectures
We offer
-
Connectivity Bonus (25,000 ARS are paid with a salary receipt at the end of each month as a non-wages concept).
-
Medicina Prepaga (It covers the collaborator and direct family group).
-
Paternity Leave (Two additional days are added to what is established by law, total of 4 days).
-
Discounts card.
-
English Training (English lessons, twice per week).
-
Training Program (Access to multiple customized training plans according to the needs of each role within the company).
-
Marriage bonus (The company doubles the allowance established by law that ANSES offers).
-
Referral Program (Referral bonus is paid when the referral of a collaborator joins the Company).
-
External Agreements and Discounts.
-
Vacations: 14 calendar days a year
By applying to our role, you are agreeing that your personal data may be used as in set out in EPAM´s Privacy Notice and Policy.