Requisition ID R9680

Data Scientist
Apply for paid position

The Data Scientist works on real-world problems in natural language processing (NLP) and speech/voice. This includes extending and improving on NLP models and methods using pooled language/voice data from SIL and its partners (in 600+ languages). The output of this work will be applied by SIL, other NGOs, and commercial entities in technology/content creation for local language communities.

Job Description

We are looking for a Data Scientist to work on real-world problems in Natural language processing (NLP) and speech/voice. This includes extending and improving on NLP models and methods using pooled language/voice data from SIL and its partners (in 600+ languages). The output of this work will be applied by SIL, other NGOs, and commercial entities in technology/content creation for local language communities.

Participate in research in machine intelligence and machine learning applications

Develop solutions for real world natural language processing problems

Help curate and pre-process pooled data from SIL and its partners such that is can be utilized in NLP/ML/AI research and development

Advise and collaborate with SIL Data Scientist(s)

Knowledge

  • Foundational math, especially statistics, calculus, and linear algebra
  • "classical" NLP tasks and tools (such as syntactic and semantic parsing, semantic relations extraction, co-reference resolution) and/or "deep-learning style" NLP (such as RNNs, CNNs, attention-based models, and word embeddings)

Skills

  • Write code in some language, and the desire to work with Python and ML frameworks: PyTorch or Tensorflow
  • Interface with infrastructure (e.g., via the command line, cloud consoles, Bash, REST APIs, etc.)
  • Effectively communicate with both technical and non-technical teams

Attitudes

  • Committed to upholding professional standards
  • Demonstrating the highest level of ethical behavior
  • Interacting positively and collaborating as a member or leader of a team, with respect toward various differing perspectives
  • Able to work and communicate effectively across cultures
  • Doing the right things in the right way for the right reasons
  • Consistently ready to learn and grow

EducationUndergraduate degree in a quantitative field (e.g., Computer Science, Engineering, Physics, Statistics) or equivalent experience is preferred

Experience

  • Hands on experience implementing or utilizing NLP methods for tasks such as NER, sentiment analysis, machine translation, tokenization, PoS tagging, etc. strongly desired

  • Hands on experience with Python and its suite of NLP tools (NLTK, SpaCy, etc.) strongly desired

Data Scientist

Job Application

Data Scientist
Apply for paid position

About SIL

Dallas, TX - Founded in 1934

SIL is a global, faith-based nonprofit that works with local communities around the world to develop language solutions that expand possibilities for a better life.

Our faith inspires and informs our commitment to expand possibilities for people to thrive. We believe all people are created by God and given language as a means for flourishing. Through language, we understand who we are, experience relationships and explore life’s most important questions.