AI Benchmarking & Evaluation Engineer
Join a team at the forefront of AI model evaluation, setting the standard for how large language models are tested and validated. In this role, you'll assess the latest AI models, design new benchmarks, and develop advanced evaluation methodologies. You'll work closely with engineers, AI researchers, and enterprise clients to ensure cutting-edge AI systems meet the highest standards. This role is a bridge between research and practical implementation and will suit someone who enjoys taking academic papers and creating working models.
Key Responsibilities:
Who You Are:
Make a real impact in AI research and developmentâapply today!
#128640; We're Hiring: Sr. User Experience Designer On-Site in Plano, TX! Are you a creative problem-solver whos passionate about user-centered design and driven to make a real impact? Our Client, a global leader in the restaurant and hospitality space (think iconic...
...Job Description Indie88 Radio is adding to our Toronto Sales Team. We are looking to hire an additional Local Radio Sales Rep who... ...brands of all sizes. Indie88 is an independent Toronto radio station with a passion for music and creating an enjoyable work environment...
...Description Position Summary: We are hiring a full time Creative Arts Therapist (LCAT) to join our interdisciplinary team at Vida Guidance... ...and make changes. Provides counseling to family members to assist with understanding and supporting patients. Engages in...
...PRIDE Health is seeking a travel nurse RN Dialysis for a travel nursing job in Joliet, Illinois. Job Description & Requirements ~ Specialty: Dialysis ~ Discipline: RN ~ Start Date: 06/23/2025~ Duration: 13 weeks ~36 hours per week ~ Shift: 12 hours, days...
...Baptist Health System - San Antonio TX is seeking a Registered Nurse (RN) Pediatrics for a nursing job in San Antonio, Texas. Job Description & Requirements ~ Specialty: Pediatrics ~ Discipline: RN ~ Duration: Ongoing ~36 hours per week ~ Shift: 12 hours...