Full Job Description
About the team and the role:
eBay, Inc. seeks Applied Researcher 1 in San Jose, CA
What you will accomplish:
Job Duties: Investigate, prototype, and evaluate state-of-the-art algorithms (from academia and industry) for time-series anomaly detection, forecasting, log pattern recognition, and graph analysis specifically tailored for observability data. Architect and design highly scalable, resilient, and cost-efficient distributed systems for ingesting, processing, storing, and querying massive volumes (petabytes/day) of observability data. Implement and operationalize analytical models and systems that correlate signals across diverse observability data (metrics, logs, traces, events) to facilitate automated or semi-automated root cause analysis of production incidents. Develop and optimize algorithms and systems for analyzing distributed tracing data using graph theory and graph database technologies to understand service dependencies, critical paths, and performance impacts. Design and refine systems for advanced alerting that minimize noise, automatically correlate related events, and deliver enriched context to on-call engineers, potentially using sophisticated data analysis techniques. Engineer and optimize software components for real-time processing, aggregation, and enrichment of high-throughput observability data streams. Potentially contribute improvements or research findings back to relevant open-source observability projects or develop internal frameworks/libraries to standardize observability practices. Create detailed technical documentation for system designs, algorithms, and operational procedures. Present research findings, system designs, and results to internal teams and potentially the broader technical community. Partial telecommuting permitted from within a commutable distance.
What you will bring:
Minimum Requirements: Master's degree, or foreign equivalent, in Computer Science, Information Technology, or a closely related field plus two years of experience in the job offered or a related occupation.
Special Skill Requirements:
1. Designing, building, and analyzing complex distributed systems (2 years)
2. Systems programming and scripting using Go, Java, and Python (2 years)
3. Observability and monitoring using Prometheus, Loki, Jaeger/Tempo, Grafana, and Kubernetes (2 years)
4. Large-scale data storage and querying systems (ClickHouse, Hadoop ecosystem) (2 years)
5. Practical experience in one or more of: time-series analysis, graph algorithms/analytics, statistical modeling and analysis techniques (including areas like anomaly detection, forecasting, pattern recognition/clustering) (2 years)
Salary: $147,200.00-196,500.00 per annum. 40 hours per week; M-F, 9:00 a.m. to 5:00 p.m.
Must be legally authorized to work in the U.S. without sponsorship.
Additional Details
Base pay offered may vary depending on multiple individualized factors, including location, skills, and experience. The total compensation package for this position may also include other elements, including a target bonus and restricted stock units (as applicable) in addition to a full range of medical, financial, and/or other benefits (including 401(k) eligibility and various paid time off benefits, such as PTO and parental leave). Details of participation in these benefit plans will be provided if an employee receives an offer of employment.
If hired, employees will be in an "at-will position" and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.
Remote roles are not eligible for U.S. visa sponsorship.