Welocalize, Inc.

Data Labeling Analyst - Speech & Voice AI

Welocalize, Inc.$70K — $95K *
Consumer Technology
Less than 5 years of experience
Job Overview by Ladders

Qualifications

  • Bachelor's degree in Computer Science, Data Science, Linguistics, Computational Linguistics, or a related field.
  • Ability to work in a fast-paced, collaborative environment.
  • Excellent communication skills.
  • Familiarity with command-line tools and interfaces.
  • Strong analytical skills to identify patterns and anomalies.

Responsibilities

  • Update training and test model databases with new synthetic textual and image data.
  • Modify and refine machine learning data creation, annotation, and rating guidelines.
  • Initiate model training processes using internal tools and command-line interfaces.
  • Evaluate the performance of trained models for deployment readiness.
  • Design and develop training datasets based on project criteria.
  • Engage in data relevance tasks to align datasets with project goals.
  • Conduct manual quality analysis of model results and report findings.

Benefits

  • Engagement in innovative machine learning projects with wide-ranging applications.
  • Opportunity to work within a collaborative team environment.
  • Chance to develop and refine skills in machine learning and NLP within a supportive setting.
  • Exposure to various data management and quality assurance techniques.
  • Potential for future projects involving translation and multi-lingual datasets.
Full Job Description
If you have a Candidate Login already, but have forgotten your password please use the steps to reset your password. If you have forgotten your email login, please contact [redacted] subject Workday Candidate Login

When creating your Workday account and entering personal information like name, address, please do not use ALL CAPS.

Thank you!

Job Responsibilities:

The ideal candidate will have a foundational understanding of machine learning, data annotation, quality assurance, and natural language processing. They will play a pivotal role in updating our machine learning models and ensuring their efficacy.
MAIN TASKS & RESPONSIBILITIES

Machine Learning Model Updates:
  • Update training and test model databases with new or amended synthetic textual and image data.
  • Modify and refine machine learning data creation, annotation, and rating guidelines.


Model Training and Evaluation:
  • Initiate model training processes using internal tools and command-line interfaces.
  • Evaluate the performance of trained models to gauge their efficacy and readiness for deployment.


Data Management and Annotation:
  • Design and develop test and training datasets as per the criteria provided by the project manager and other full-time employees.
  • Handle data efficiently, ensuring its integrity throughout the workflow.
  • Engage in data relevance tasks, ensuring data sets are aligned with project goals.
  • Annotate data accurately, ensuring it adheres to set guidelines.


Quality Assurance and Analysis:
  • Conduct manual quality analysis of model results.
  • Recognize error patterns and report anomalies for further investigation.
  • Deliver detailed reports on findings, including aspects such as utterance quality, LLM evaluation, ASR bug tracking, and customer pain points to be reviewed by the User Experience Research team.
  • Implement basic quality control measures and ensure the reliability of processed data.
  • Utilize intermediate data analysis techniques to extract insights and inform decision-making.
  • Arbitrate discrepancies effectively, ensuring consistent data quality.


Linguistic and NLP Tasks:
  • Apply basic knowledge of natural language processing and linguistics to data processing tasks.
  • Ensure linguistic accuracy in all processed and annotated data.


REQUIREMENTS

Preferred Qualifications:
  • Bachelor's degree in Computer Science, Data Science, Linguistics or Computational Linguistics or a related field.


Experience:
  • Ability to work in a fast-paced, collaborative environment.
  • Excellent communication skills


Skills & Knowledge:
  • Familiarity with command-line tools and interfaces.
  • Strong analytical skills with the ability to identify patterns and anomalies.


Additional Information:

This role primarily focuses on English US data sets; however, familiarity with translation or multi-lingual data sets can be a plus for future projects.

Additional Job Details:

About Welocalize, Inc.

Welocalize, Inc. is a global translation and localization company that provides language services to businesses and organizations around the world. The company was founded in 1997 and has since grown to become one of the largest language service providers in the world. Welocalize offers a wide range of services, including translation, interpretation, localization, and language consulting. The company has a team of over 1,500 professionals and operates in over 250 languages. Welocalize is committed to providing high-quality language services to its clients and has implemented several quality control measures to ensure accuracy and consistency. The company is privately held and headquartered in Baltimore, MD.
Learn more about Welocalize, Inc.
Size
1,500 employees
Industry
Founded
1997

Similar Jobs

More Jobs at Welocalize, Inc.

More Consumer Technology Jobs

Find similar Data Labeling Analyst - Speech & Voice AI jobs: