Machine Learning Evaluation Specialist - Remote

contractdataremote FROM πŸ‡ΊπŸ‡Έ πŸ‡§πŸ‡΄ πŸ‡ͺπŸ‡¨ πŸ‡΅πŸ‡Ύ πŸ‡»πŸ‡ͺ πŸ‡§πŸ‡· πŸ‡ΊπŸ‡Ύ πŸ‡΅πŸ‡ͺ πŸ‡¦πŸ‡· πŸ‡¨πŸ‡΄ πŸ‡¨πŸ‡± πŸ‡²πŸ‡½ πŸ‡΅πŸ‡· πŸ‡¨πŸ‡¦ πŸ‡©πŸ‡΄
Open to candidates in: Us, Bolivia, Ecuador, Py, Venezuela, Brazil, Uruguay, Peru, Argentina, Colombia, Chile, Mexico, Pr, Ca, Ge, Hu, Tr, Do, Ee, Es, Gb, Ie, Sk, Ro, Ba, Lv, Lt, Pl, Bg, Pt, Gr, Al, It, Xk, Me, Mt, Mk, Rs, Fr, Cz, De, Be, At
G2i Inc.
🏭 Software Development
πŸ“ Delray Beach, Florida, US
πŸ‘€ 11-50

Machine Learning Evaluation Specialist (Remote)

List of accepted countries and locations

Important for US applicants: This is a 1099 independent contractor role and is not compatible with F-1 OPT, STEM OPT, or other visa statuses that require W-2 employment, guaranteed hours, or employer sponsorship. We are unable to provide offer letters or employment verification for this role.

Help design the hardest ML problems state-of-the-art AI hasn't solved yet.

We're hiring domain experts to build evaluation tasks that challenge the frontier of AI. This is not an ML engineering role β€” it's a research role. You'll use deep expertise in your field to create problems that general ML knowledge can't touch.

What you'll do

  • Propose and frame original, research-grade ML problems rooted in your domain

  • Design evaluation tasks that require specialized knowledge well beyond standard pipelines

  • Assess AI-generated solutions for correctness, creativity, and methodological rigor β€” and explain exactly where and why they fall short

  • Document problem difficulty, required domain knowledge, and expected failure modes

What you need

  • Graduate-level expertise (MS or PhD preferred) in a scientific or technical domain that intersects with ML

  • Strong working knowledge of ML methods β€” model selection, feature engineering, evaluation metrics

  • Deep familiarity with active research problems in your field β€” you know where general ML knowledge runs out

  • Excellent written communication β€” you can articulate complex problems clearly and precisely. This cannot be overstated.

  • Self-motivated and comfortable working independently on intellectually demanding tasks

What you don't need

  • No prior AI training or RLHF experience required

  • No software engineering background needed β€” domain expertise and research instincts are what matter

Domains we're especially looking for

  • Computational Biology / Bioinformatics

  • Genomics / Molecular Biology

  • Physics / Astrophysics / Signal Processing

  • Climate / Environmental Modeling

  • Healthcare / Medical Imaging

  • Neuroscience / Brain-Computer Interfaces

  • Materials Science / Chemistry

  • Finance / Quantitative Modeling

  • Robotics / Control Systems / Reinforcement Learning

  • Advanced NLP (specialized domains)

  • Mathematics / Statistics (applied)

Logistics

  • Fully remote β€” work from anywhere

  • $200–$400/hr depending on domain and seniority

  • 10–40 hrs/week, hourly contract

  • Assessment required β€” paid if approved

  • Independent contractor (1099) β€” not compatible with F-1 OPT, STEM OPT, or visa statuses requiring W-2 employment or employer sponsorship

⚠️ This is a project-based, freelance opportunity with no guaranteed hours. We recommend keeping other work options open while waiting for project assignment.

G2i Inc.
🏭 Software Development
πŸ“ Delray Beach, Florida, US
πŸ‘€ 11-50