I'm an AD

OpenAI Claims AI Models Rival Human Experts in 44 Professions... are you among them?

OpenAI has unveiled a new benchmark, GDPval, which it says shows state-of-the-art artificial intelligence models are now capable of matching or exceeding human experts in 44 distinct knowledge-based occupations across nine major U.S. industries. The evaluation tests models on over 1,300 real-world work tasks, with the results indicating rapid advances in AI’s ability to generate deliverables such as legal briefs, engineering plans, and journalistic content at speeds and costs far outpacing traditional methods.

Tasks were created and assessed by professionals averaging 14 years of industry experience, ensuring that AI outputs were measured against authentic standards of workplace quality. According to OpenAI, Claude Opus 4.1 from Anthropic achieved the highest win and tie rate compared to human experts, while the latest GPT-5 release also demonstrated significant improvement. However, OpenAI notes that real-world jobs involve more than just documented tasks and cautions that workplace implementation remains complex.

Below is the list of the 44 occupations where AI models have demonstrated expert-level performance, as claimed by OpenAI’s GDPval benchmark:

  • Concierges

  • Property, real estate, and community association managers

  • Real estate sales agents

  • Real estate brokers

  • Counter and rental clerks

  • Recreation workers

  • Compliance officers

  • First-line supervisors of police and detectives

  • Administrative services managers

  • Child, family, and school social workers

  • Mechanical engineers

  • Industrial engineers

  • Buyers and purchasing agents

  • Shipping, receiving, and inventory clerks

  • First-line supervisors of production and operating workers

  • Software developers

  • Lawyers

  • Accountants and auditors

  • Computer and information systems managers

  • Project management specialists

  • Registered nurses

  • Nurse practitioners

  • Medical and health services managers

  • First-line supervisors of office and administrative support workers

  • Medical secretaries and administrative assistants

  • Customer service representatives

  • Financial and investment analysts

  • Financial managers

  • Personal financial advisors

  • Securities, commodities, and financial services sales agents

  • Pharmacists

  • First-line supervisors of retail sales workers

  • General and operations managers

  • Private detectives and investigators

  • Sales managers

  • Order clerks

  • First-line supervisors of non-retail sales workers

  • Sales representatives, wholesale and manufacturing (except technical and scientific products)

  • Sales representatives, wholesale and manufacturing (technical and scientific products)

  • Audio and video technicians

  • Producers and directors

  • News analysts, reporters, and journalists

  • Film and video editors

  • Editors.

The release of GDPval highlights growing competitive stakes among leading AI developers as they pursue integration of advanced models into workplace settings and e-commerce, while ongoing debates continue regarding the true return on investment for businesses deploying generative AI at scale.

(Image: generated by Google AI Studio)

Powered by Blogger.