Role overview
Seniority:
Senior
City:
India
Language:
English
We are looking for Data Scientist – Data Engineering & LLM Ops and our client is a leading global organization; you will be part of an international project team.
What we're looking for
- Ingestion Pipeline Development: Design and maintain robust data pipelines (Python/Snowflake) to ingest data from diverse sources including RSS feeds, web scraping, and API integrations.
- Snowflake Architecture: Manage the “References” raw data table and ensure seamless data flow between the ingestion layer and the OutSystems-based curation app.
- LLM Integration & Refinement: Implement and fine-tune Large Language Models (LLMs) to categorize, summarize, and extract “Reasoning Modelling” insights from raw policy text.
- LLM Ops: Establish monitoring for model performance, hallucination detection, and iterative feedback loops based on SME curation.
- Scalability: Support the migration to a 24-hour automated cycle within the Snowflake infrastructure.
- Languages: Expert-level Python (specifically for data manipulation and AI orchestration).
- Data Engineering: Strong experience with Snowflake and building ETL/ELT pipelines.
- AI/NLP: Hands-on experience with LLM frameworks and prompt engineering.
- Web Ingestion: Proficiency in handling RSS feeds, structured/unstructured web data, and API-based data collection.
- DevOps/LLM Ops: Familiarity with version control (Git) and deploying AI models into production environments.
Perks and benefits:
Career growth – access professional development through partnerships with Microsoft, AWS, Snowflake, Salesforce, and more.
Project diversity – opportunity to work with global clients across sectors and technologies.
Flexibility – adaptable working hours and remote/hybrid work options.
Engaging community – participate in team-building events, hackathons, and CSR initiatives.
Team-building traditions – including our annual company event.