Role overview
*(This role is based in Trepp's NYC office and follows a hybrid work schedule)
About this Role**
As a Senior Data Scientist at Trepp, you will take a leading role in designing, developing, and deploying advanced data science and GenAI solutions that directly support Trepp's analytics and product offerings. You will operate with a high degree of autonomy, owning complex projects end-to-end while partnering closely with product, engineering, and business stakeholders.
This role emphasizes technical leadership, GenAI production expertise, and mentorship, while continuing to contribute hands-on to statistical, machine learning, and applied analytics initiatives. Your work will shape Trepp's next generation of data-driven and AI-powered capabilities in commercial real estate and structured finance.
What you'll work on
- Partner with product managers, engineers, and domain experts to translate ambiguous business problems into scalable, data-driven solutions.
- Architect and implement GenAI applications leveraging large language models (LLMs), retrieval-augmented generation (RAG), embeddings, agentic architectures and prompt engineering techniques.
- Build, evaluate, and maintain machine learning models using appropriate statistical, ML, and deep learning approaches.
- Ensure model quality through rigorous experimentation, validation, monitoring, and performance evaluation.
- Communicate complex technical concepts, results, and trade-offs clearly to both technical and non-technical stakeholders.
- Provide technical mentorship and guidance to junior data scientists, contributing to best practices and team standards.
- Contribute to the evolution of Trepp's data science and GenAI strategy, tools, and methodologies.
What we're looking for
Education & Experience
- Bachelor's or Master's degree in Data Science, Computer Science, Statistics, Engineering, or a related quantitative field.
- 5+ years of professional data science or applied machine learning experience.
- 2+ years of hands-on experience developing, deploying, and maintaining GenAI solutions in production environments.
Core Technical Skills
- Advanced proficiency in Python for data analysis, machine learning, and GenAI development.
- Strong SQL skills and experience working with large, complex datasets.
- Experience with modern ML libraries and frameworks (e.g., scikit-learn, PyTorch, TensorFlow).
- Proven experience with LLMs and GenAI tooling, including:
- Prompt engineering and evaluation
- Embeddings and semantic search
- Retrieval-augmented generation (RAG) architectures
- Model APIs and open-source LLM frameworks
- Experience deploying models and GenAI services using cloud platforms (e.g., AWS).
- Proficiency with GitHub and collaborative development workflows.
Professional Skills
- Strong analytical thinking and problem-solving ability, with attention to detail.
- Ability to independently own complex projects and manage ambiguity.
- Excellent verbal and written communication skills, with the ability to tailor messaging to different audiences.
- Ability to balance multiple priorities and work effectively in a fast-paced, cross-functional environment.