Full Job Description
The Associated Press' Metadata and Data Science Team seeks a Data Scientist based in New York, NY.
Why this role matters:
The Data Scientist will design and implement data science and applied machine learning solutions supporting new product development, search and discovery on platform, content enrichment and metadata generation. As a member of cross-functional project teams, the Data Scientist will perform data analysis, evaluate commercial and open-source models, and deliver solutions with real-world impact.
The team works closely with various departments and functions across the organization to design, implement and manage end-to-end content metadata, to maintain the integrity of schema standards, and to build solutions with data, analytics and machine learning methods.
What you will do:
3 Evaluate, fine-tune, and maintain statistical and machine learning models used in run-time production environments, measuring and communicating performance improvements to stakeholders3 Partner with cross-functional teams to design and optimize AI/ML solutions that deliver new product capabilities and internal workflow improvements, using news articles, photos, videos, election results, and other news data3 Research, evaluate, and recommend models and methodologies across the AI/ML landscape, presenting recommended solutions to technical and non-technical stakeholders3 Identify and address gaps in model quality and performance metrics, synthesizing findings into clear, actionable recommendations3 Contribute to the design and enhancement of data and ML pipelines, including multimodal embedding generation and knowledge extraction, with a focus on accuracy, efficiency, and scalability3 Design user-centered solutions and search algorithms focused on quality and performance3 Stay current with emerging technologies and advances in NLP, machine learning, and data science, proactively surfacing opportunities for improvement3 Support the full model development lifecycle, from problem definition and prototyping through integration, deployment, monitoring, and iteration3 Communicate analysis and present findings clearly, adapting to a range of technical and business audiences
Who you are:
3 3+ years of relevant data science experience, with strong proficiency in Python including NumPy, Pandas, and large-scale semi-structured JSON data3 Bachelor's degree in Data Science or Computer Science3 Experienced applying core machine learning methods including classification, clustering, regression, and ranking3 Hands-on experience with NLP techniques such as entity recognition, disambiguation, semantic similarity, and embedding-based retrieval3 Experience with transformer models for structured extraction, classification, summarization, and generation3 Experience with hybrid search algorithms, retrieval pipelines, intent detection, query expansion, and relevance tuning in Elasticsearch or OpenSearch3 Experience working with both language and multimodal models3 Experience and comfort working with real-world data, including text and visuals, at scale3 Familiar with ML engineering and ML Ops practices, with a track record of delivering runtime solutions3 Familiarity with cohort analysis, session segmentation, A/B testing, and confidence calibration3 Analytical and curious, with strong problem-solving skills and a practical focus on high-impact, cost-aware solutions3 Able to effectively manage multiple project deliverables simultaneously3 Comfortable being accountable for deliverables across the full product development lifecycle, from problem definition through launch and iteration3 An effective communicator who can tailor analysis and presentations to both technical and non-technical audiences3 Collaborative and empathetic, with a genuine focus on user impact and a desire to grow data literacy across the organization3 Advanced-level professional competency in written and spoken English3 Authorization to work in the United States for any employer
What will set you apart:
3 Experience in news media or working with news as data strongly preferred3 Master's degree in Data Science or a related field3 Familiarity with graph data models and designing entity-relationship schemas3 Eagerness to learn the technical nuances of large-scale media operations and identify opportunities within evolving systems
Location:
This role is based in New York City with a hybrid work schedule. AP employees are onsite three days a week, Tuesday, Wednesday and Thursday. Local candidates are preferred, but all qualified applicants are encouraged to apply.
Why join us:
3 A mission-driven, inclusive environment focused on both individual and collective success.3 Opportunities for professional development to help you reach your career goals.3 Access to tools, mentorship, and resources tailored to elevate your proficiency and contributions.
Salary & Benefits:
The anticipated salary range for this position is $116,000 - $160,000, based on a candidate's skills, qualifications, and location. The Associated Press offers comprehensive benefits, which include:
3 Competitive medical, dental and vision coverage3 Retirement benefits3 Company paid life insurance3 Paid vacation and sick days3 Paid parental leave for any new parent3 Mental well-being resources
Deadline for applications is 11:59pm ET on June 22, 2026.