Senior ML Engineer(D.O) - Remote

settings_input_antenna Remote

KỸ NĂNG

Machine Learning

MÔ TẢ CÔNG VIỆC

- Develop, train, and optimize machine learning models using CatBoost, LightGBM, or XGBoost for production environments.
- Design and improve Learning-to-Rank models using pairwise/listwise ranking approaches and ranking evaluation metrics such as NDCG and MAP.
- Analyze and improve probability calibration performance using techniques such as isotonic regression, Platt scaling, Beta calibration, and Venn-Abers predictors.
- Detect, troubleshoot, and resolve train-serve skew and data leakage issues in real-world ML systems.
- Build robust feature engineering pipelines for time-series and event-based datasets with proper temporal validation strategies.
- Collaborate with cross-functional teams to deploy, monitor, and maintain ML models in production environments.
- Work with large-scale datasets using Python, pandas, PyArrow, PostgreSQL, and related ML tooling.
- Contribute to model evaluation, experimentation, and continuous improvement processes.
- Participate in technical discussions, documentation, and weekly review meetings in English.

YÊU CẦU CÔNG VIỆC

- 5+ years of professional ML experience, including 2+ years training and shipping gradient boosted models in production (CatBoost, LightGBM, or XGBoost).
- Hands-on experience with Learning-to-Rank (pairwise/listwise objectives, NDCG, MAP) on real datasets — not only classification.
- Solid grasp of probability calibration: ECE, Brier score, reliability diagrams, isotonic / Platt / Beta / Venn-Abers, and the difference between sharpness and calibration.
- Proven track record of diagnosing and fixing train-serve skew in a real system (please be ready to describe a specific incident in the interview).
- Strong time-leakage awareness for time-series / event data: target encoding, rolling features, pedigree-style features, and how to validate them with proper temporal splits.
- Fluent in Python and the standard scientific stack (NumPy, pandas, scikit-learn, PyArrow). Comfortable in a UNIX/Docker environment.
- Working English (written and verbal) sufficient to read/write technical reports and join weekly review calls.

2. Nice to have:
- Experience with sports / pari-mutuel / betting models, or any odds-driven domain.
- Familiarity with handicapping concepts: Quirin speed, pace figures, workout features, class ratings.
- Experience with conformal prediction, Venn-Abers predictors, or distribution-free calibration.
- Bayesian feature engineering experience (used offline as feature generators only, not online).
- Open-source contributions to CatBoost / scikit-learn / calibration libraries.

3. Tech stack:
Python 3.11+, CatBoost, scikit-learn, pandas, PyArrow, FastAPI (serving side, read-only for this role), Docker, Git, uv. Some
PostgreSQL for race data. CI runs on GitHub Actions.

QUYỀN LỢI

- Receive 100% salary from the onboarding date.
- Participate in company activities: Teambuilding, travel and other activities.
- Work with large and advanced systems, have the opportunity to develop comprehensive technology skills with complex problems, requiring high accuracy.

MỨC LƯƠNG

upto 40 triệu

work Loại hình làm việc :

Remote

event Hạn ứng tuyển:

22/05/2026

date_range Kinh nghiệm:

3 năm

school Học vấn:

Không yêu cầu

people Số lượng:

switch_account Cấp bậc:

Senior

Hỗ trợ ứng tuyển

Nguyễn Thị Vân Anh

anhntv1@hatonet.com

Việc khác cùng kỹ năng

Việc làm mà bạn đang tìm kiếm không khả dụng ,
Hãy thử đăng ký tài khoản Freelancer tại Hatonet để tìm kiếm thêm nhiều cơ hội khác từ các doanh nghiệp trên toàn thế giới Đăng ký tài khoản Freelancer

Kéo và thả hoặc chọn file để tải lên

PDF, EXCEL, DOCX (10MB)