Role overview
**AI/LLM Engineer**
**3-Month Contract | Hybrid (San Jose, CA)**
W2 ONLY
**Overview:**
Our client is seeking an experienced
**AI/LLM Engineer**
to design, optimize, and deploy large language models for real-world applications. This role combines deep technical expertise in model development with hands-on experience in scalable systems and production integration.
**Responsibilities:**
* **Model Development & Optimization:**
Design, train, fine-tune, and evaluate large language models (LLMs) to improve performance, efficiency, and alignment with business or research objectives.
* **Systems Integration & Deployment:**
Build scalable inference pipelines, optimize serving infrastructure (e.g., quantization, caching, distillation), and integrate models into products, APIs, or services.
* **Research & Collaboration:**
Experiment with new architectures, prompt-engineering techniques, and retrieval systems. Partner with data science, product, and ML operations teams to translate research into production-ready solutions.
**Contract Details:**
* **Duration:**
3-month contract (potential for extension)
* **Location:**
Hybrid — onsite in
**San Jose, CA**