California Institute of Technology
AI

Data Scientist and Application Developer

California Institute of Technology · Pasadena, CA, US · $109k - $134k

Actively hiring Posted 13 days ago

Role overview

The Einstein Papers Project at Caltech has an opening for a Data Scientist and Application Developer. Come be a part of the team that is researching and publishing Albert Einstein’s published and unpublished writings, his letters and those of his colleagues, family, and friends, his calculation drafts, notebooks, and interviews.

This role will work with a vibrant team of scientists and historians and philosophers of science to carry out text and image processing, extracting relevant information from printed sources, photographs, and handwritten manuscripts. The major source of data consists of some 100,000 archival records and tens of thousands of printed sources. We are interested in archive services and analysis tools related to the discovery of specific search objects (text, equations, concepts) and their characterization.

What you'll work on

  • As an Einstein Papers analyst and developer, you will work with the team members to develop systems to ingest, process and deliver content and tools to users, using modern methodologies and AI software, particularly those tools developed to support digital humanities research, methods and platforms.
  • Developing and improving infrastructure to ingest and organize database records and their associated files (jpg, tiff, pdf)
  • Automating and improving the management of our repositories (move data to GitHub or other cloud-based repository)
  • Streamline deployment of software into test and production environments, through continuous integration, and troubleshoot runtime issues.

What we're looking for

  • Knowledge of Adobe products (Photoshop, InDesign) and Adobe FrameMaker (or willingness to learn).
  • Production experience with AWS.
  • Experience with configuration management, Git/GitHub.
  • Strong Linux/UNIX skills.
  • Experience with SQL.
  • Ability to read German is a great plus.

Tags & focus areas

Used for matching and alerts on DevFound
Fulltime Data Science Ai