Senior Data Scientist Job at SES, Remote

M3lOZWUrWW9RYitoZndCVE5YQmtTeEFPVkE9PQ==
  • SES
  • Remote

Job Description

Senior Data Scientist, Natural Language Processing and Data Annotation Expert

About us

At SES AI, we are at the forefront of revolutionizing lithium-metal battery creation with our groundbreaking approach that integrates cutting-edge machine learning techniques into our research and development processes. Our mission is to lead the next wave of scientific discovery in material science, powered by advanced AI technologies with a dedication to AI for Science.

To learn more about SES, please visit:

Position Scope

We are seeking a seasoned (senior) Data Scientist specializing in Natural Language Processing (NLP) and Data Annotation to spearhead our innovative projects. The ideal candidate will possess exceptional expertise in NLP,. utilizing state-of-the-art multimodal language models to conduct the retrieval and extraction of intricate chemical information within the realms of material and battery science. Moreover, we are looking for an individual with a profound understanding of designing language-based data labeling pipelines to extract scientific chains of thought for advanced reasoning and discovery. This pivotal role will involve leading the creation of labeled datasets crucial for training our cutting-edge language models and AI agents.

This role will be remote.

Responsibilities

  • Lead the design and implementation of advanced NLP techniques and methodologies to extract intricate scientific concepts and reasoning from vast textual sources.
  • Lead the design and implementation of advanced NLP techniques and methodologies to extract chemical information including SMILES notations, properties, and interleaved text for multimodal language model training and chemical property predictions.
  • Develop and refine language-based data labeling pipelines tailored for scientific discovery, ensuring high-quality annotated datasets for training large language models and AI agents.
  • Collaborate closely with cross-functional teams to identify key research areas and define labeling strategies to capture nuanced scientific insights effectively.
  • Spearhead the development of innovative approaches for data annotation, incorporating state-of-the-art NLP algorithms to enhance accuracy and efficiency.
  • Provide expert guidance on data annotation best practices, ensuring consistency and quality across labeled datasets.
  • Conduct thorough analyses to evaluate the effectiveness of labeling pipelines and make continuous improvements to optimize performance.
  • Stay abreast of the latest advancements in NLP and data annotation techniques, integrating emerging methodologies to enhance our data labeling capabilities.

Preferred Qualifications

  • Experience with AI agent's studies, using knowledge-based Retrieval-Augmented Generation (RAG) to facilitate the accuracy of language generation.
  • Experience with cloud computing platforms and services (e.g., AWS, Azure, Google Cloud) for scalable data processing and storage.
  • Knowledge of data visualization techniques and tools for exploring and presenting scientific insights.

Qualifications

  • Advanced degree (master's or PhD preferred) in computer science, data science, or a related field.
  • Extensive hands-on experience in natural language processing, with a strong emphasis on designing and implementing language-based data labeling pipelines.
  • Proven track record of leveraging NLP techniques to extract complex scientific concepts and reasoning from textual sources.
  • Familiarity with deep learning models and architectures for NLP tasks, such as transformer-based models (e.g., BERT, GPT).
  • Proficiency with Git and Linux based systems and proficiency in programming languages such as Python, R, or Java, along with expertise in relevant libraries and frameworks (e.g., PyTorch, NLTK, TensorFlow).
  • Exceptional problem-solving skills with meticulous attention to detail, coupled with a passion for advancing scientific discovery through data science.
  • Excellent communication and collaboration skills, with the ability to effectively convey complex technical concepts to diverse stakeholders.

Job Tags

Remote job,

Similar Jobs

Tandym Health

Physician / Pediatrics / New Jersey / Permanent / Pediatric Urgent Care Physician / PEM Job Job at Tandym Health

 ...Physicianwill be responsible forevaluating and providing evidence-based treatment to pediatric urgent care patients in both in-person and telemedicine settings. Responsibilities: ThePediatric Urgent Care Physician / PEM will: Assess and treat acute injury and illness... 

The Fur Bus

Driver / Chauffeur Job at The Fur Bus

 ...Fur Bus is searching for Class B CDL with passenger endorsement drivers (ChauFURs) to operate a variety of vehicles ranging from sprinter...  ...of 2 years ~ Valid DOT med card and able to pass a drug test ~ At least 25 years of age ~7-year clean MVR ~ Available Thursday... 

Westinghouse

Insulator Job at Westinghouse

BHI has an immediate opportunity for an experience nuclear Project Scheduler. This is long term position with competitive pay and benefits.

Clara Maass Medical Center

Registered Nurse (RN) Women's Health Unit/Mother/Baby Per Diem Nights Job at Clara Maass Medical Center

 ...a highly dedicated Registered Nurse for ourWomen's Health Unit/Mother/Baby Unit at Clara Maass Medical Center in Belleville, NJ. With over...  ...Required: BLS from the American Heart Association Active NJ RN license or compact RN license with NJ endorsement NRP... 

Brock Canada Industrial Ltd

Journeyperson Insulators- LNGC Kitimat ( Locals) Job at Brock Canada Industrial Ltd

 ...Brock Canada Industrial Ltd. is hiring experienced Journeyperson Insulators for LNGC in the Kitimat, BC area. We are currently looking for a experienced, hard-working, and motivated Journeyperson Insulators in the Kitimat, BC Area. Shift: 5&2 Hours: 8 Hours...