Role overview
*Job Title: Software AI Engineer/Architect
Location: Santa Clara, CA (onsite preferred but remote candidates can be considered)
Experience: 8- 10 yrs
Job Type: Contract/ FTE**
This role requires deep, end-to-end understanding of how Large Language Models are built, trained, optimized, deployed, and operated.
Candidates must demonstrate hands-on experience beyond consuming hosted LLM APIs, with a strong grasp of the underlying ML theory, system trade-offs, and production realities of AI/ML solutions.
*Mandatory Competency Areas (Non-Negotiable)
- Foundations of LLMs (How They Actually Work)**
Candidate must demonstrate first-principles understanding, including:
- Transformer architectures (attention, embeddings, positional encoding)
- Tokenization strategies and their impact on cost & performance
- Training vs inference behavior
- Loss functions, pre-training objectives, and alignment techniques (SFT, RLHF)
- Limitations: hallucinations, bias, context collapse, long-range degradation
Clear understanding of:
- Prompt injection and data leakage risks
- Training data privacy and IP protection
- Model abuse, misuse, and guardrails
- Regulatory and compliance considerations
- Responsible AI principles in production systems