News

The dataset — called HealthBench — is OpenAI's first major independent health care project. It includes 5,000 “realistic ...
OpenAI, the creator of artificial intelligence chatbot ChatGPT, has a new open-source large language model called HealthBench ...
OpenAI, the maker of ChatGPT, released an open-source benchmark designed to measure the performance and safety of large ...
OpenAI has launched HealthBench, a new dataset designed to test how accurately AI models respond to real-world health care ...
OpenAI has launched HealthBench, a comprehensive dataset to assess the performance of AI models in answering health-related ...
Experts say it improves AI evaluation but warn that more review is needed ...
Experts call it a major step forward, but they also say more work is needed to ensure safety. The dataset — called HealthBench — is OpenAI's first major independent health care project. It includes ...