Role overview
Apexon is seeking an experienced
AI/ML Engineer
with strong expertise in
LLM development, MLOps, and building scalable GenAI solutions
. You will design, build, and operationalize AI/ML systems that support enterprise clients across healthcare, BFSI, retail, and digital transformation engagements.
The ideal candidate has hands-on experience building
end-to-end machine learning pipelines
, optimizing
large language model workflows
, and deploying
secure ML systems
in production environments.
*Responsibilities
LLM & AI Solution Development**
- Build, fine-tune, evaluate, and optimize Large Language Models (LLMs) for client-specific use cases such as document intelligence, chatbot automation, code generation, and workflow orchestration.
- Develop RAG (Retrieval-Augmented Generation) pipelines using enterprise knowledge bases.
- Implement prompt engineering, guardrails, hallucination reduction strategies, and safety frameworks.
- Work with transformer-based architectures (GPT, LLaMA, Mistral, Falcon, etc.) and develop optimized model variants for low-latency and cost-efficient inference.
What we're looking for
- Experience implementing Generative AI solutions for enterprise clients.
- Expertise in distributed training, quantization, optimization, and GPU acceleration.
- Experience with:
- Vector Databases (Pinecone, Weaviate, FAISS)
- RAG frameworks (LangChain, LlamaIndex)
- Monitoring tools (Prometheus, Grafana, CloudWatch)
- Understanding of model governance, fairness evaluation, and client compliance frameworks.