Robots Atlas>ROBOTS ATLAS
GPT-Rosalind

GPT-Rosalind

Specialized OpenAI reasoning model designed for life sciences research, drug discovery, genomics, and chemistry. The first model in the Life Sciences series.
⏳ Preview🏢 EnterpriseReasoning modelScientific AISpecialized AITool-using model
Release date
17 April 2026
Access:APIHostedDeployment:☁ Cloud

Overview

GPT-Rosalind is a specialized reasoning model developed by OpenAI for applications in the life sciences, drug discovery, and translational medicine. It is the first model in a planned Life Sciences series and was introduced on April 17, 2026, as a research preview for select enterprise customers in the United States under the Trusted Access program. The model's name references British chemist and crystallographer Rosalind Franklin, whose X-ray diffraction research contributed to the discovery of the double helix structure of DNA.

The model was built on OpenAI's latest internal models and fine-tuned specifically for deep biological analysis. OpenAI trained it on 50 of the most common biological workflows and integrated access to major public biological databases. GPT-Rosalind supports multi-step scientific workflows, including literature evidence synthesis, biological hypothesis generation, experiment planning, and genomic and biochemical data analysis. It can query specialized databases, process scientific literature, interface with computational tools, and suggest new experimental pathways within a single interface.

The model is available in ChatGPT, Codex, and via the OpenAI API exclusively to qualified enterprise customers in the US who have passed a security review and demonstrated that they conduct legitimate research with clear public benefit. During the research preview phase, usage does not draw from existing credits or tokens. Alongside the model, OpenAI released a free Life Sciences Research Plugin for Codex, available on GitHub, connecting models to over 50 scientific tools and databases — including AlphaFold, Bgee, and BindingDB — covering human genetics, functional genomics, protein structure, biochemistry, and clinical research discoveries.

Partners and applications

Confirmed partners and customers collaborating on the model at launch include Amgen, Moderna, Novo Nordisk, Thermo Fisher Scientific, Oracle Health and Life Sciences, NVIDIA, Allen Institute, Benchling, and UCSF School of Pharmacy. OpenAI is also collaborating with Los Alamos National Laboratory on AI-assisted protein and catalyst design, and Dyno Therapeutics participated in the model evaluation phase. Consulting firms McKinsey & Company, Boston Consulting Group, and Bain & Company are associated with OpenAI's Life Sciences team.

Benchmark results

On the BixBench benchmark — which evaluates bioinformatics tasks and data analysis in real research environments and was developed by Edison Scientific — GPT-Rosalind achieved a Pass@1 score of 0.751, the highest among models with published results. For comparison: GPT-5.4 scored 0.732, Grok 4.2 scored 0.728, GPT-5.2 scored 0.698, GPT-5 scored 0.611, and Gemini 3.1 Pro scored 0.550 (OpenAI data). On the LABBench2 benchmark — covering literature search, database access, sequence manipulation, and protocol design — GPT-Rosalind outperformed GPT-5.4 on 6 of 11 tasks, with the largest margin in the CloningQA category, which requires comprehensive design of DNA and enzymatic reagents for molecular cloning protocols. In OpenAI's internal tests, the model outperformed GPT-5, GPT-5.2, and GPT-5.4 across all five categories: chemistry, biochemistry and protein understanding, phylogenetics, experiment design and analysis, and tool use.

In an evaluation conducted in collaboration with Dyno Therapeutics on unpublished, never-before-seen RNA sequences — to exclude training data memorization — the model's best-of-ten submissions ranked above the 95th percentile of human experts on a sequence-to-function prediction task and at the 84th percentile on sequence generation. This evaluation was benchmarked against a set of 57 historical results from AI and biology experts.

Access and safety

Given the model's dual-use potential regarding biological threats, OpenAI has imposed strict access restrictions. Organizations applying for access must pass a security review and demonstrate three conditions: conducting legitimate scientific research with clear public benefit, having appropriate governance, compliance, and misuse-prevention procedures in place, and restricting access exclusively to approved users in secure, managed environments. OpenAI has built systems into the model for detecting potentially dangerous activities. Pricing and broad availability will be announced after the research preview phase concludes.

Classification
Reasoning modelScientific AISpecialized AITool-using model
Access & deployment
APIHosted
Cloud
Weights: Closed
Key parameters
Tools
📥 Input: text, structured data, documents

Technical specification

Features:Tool use
Modalities
⬇ Input
textstructured_datadocuments
⬆ Output
textstructured_datacode

Capabilities and applications

Native model capabilities
Reasoning
The model's ability to reason logically and solve complex problems.
Category: reasoning
Multi-step reasoning
Carrying out multi-step chains of reasoning across long, complex tasks.
Category: reasoning
Planning
Forming and executing action plans for complex tasks.
Category: planning
Structured output
Producing data in structured formats such as JSON.
Category: structured_generation
Function Calling
Category: planning

Benchmark results

4 benchmarks
BixBench
Pass@1
0.751
📅 17 Apr 2026📄 OpenAI (raport własny, ogłoszenie premiery modelu)
A bioinformatics and data analysis benchmark developed by Edison Scientific, evaluating models on real-world bioinformatics tasks. Comparative results per OpenAI data: GPT-5.4 — 0.732; Grok 4.2 — 0.728; GPT-5.2 — 0.698; GPT-5 — 0.611; Gemini 3.1 Pro — 0.550.
LABBench2
Liczba zadań z wynikiem wyższym niż GPT-5.4 (z 11 łącznie)
6/11 zadań powyżej GPT-5.4
📅 17 Apr 2026📄 OpenAI (raport własny, ogłoszenie premiery modelu)
The benchmark covers literature review, database access, sequence manipulation, and protocol design. The largest improvement was recorded in the CloningQA category (comprehensive design of DNA and enzymatic reagents for molecular cloning protocols).
Dyno Therapeutics RNA Evaluation
Percentyl relative to human experts (best-of-ten submissions) · Evaluation on unpublished RNA sequences (not contaminated with training data); comparison against 57 historical results from human AI-biology experts.
powyżej 95. percentyla (predykcja); 84. percentyl (generowanie sekwencji)
📅 17 Apr 2026📄 OpenAI i Dyno Therapeutics (ocena zewnętrzna, raport własny)
Tasks: RNA sequence–function prediction and sequence generation. Best-of-ten — selection of the best result from 10 model submissions.
Wewnętrzny benchmark OpenAI (5 kategorii naukowych)
Relative ranking
Najwyższy wynik spośród GPT-5, GPT-5.2, GPT-5.4 we wszystkich 5 kategoriach
📅 17 Apr 2026📄 OpenAI (raport własny, ogłoszenie premiery modelu)
Categories: chemistry, biochemistry and protein understanding, phylogenetics, experiment design and analysis, tool use. Largest performance gaps in experiment design and chemistry.

Technical architecture

Core Architecture
Training Techniques

Deployment and security

🔒 Security / Enterprise
✓ Verified enterprise information

The model is available exclusively to qualified corporate clients in the US under the Trusted Access program. Organizations must pass a security review covering three criteria: (1) conducting legitimate scientific research with clear public benefit, (2) having appropriate governance, regulatory compliance, and abuse prevention procedures, and (3) restricting access to approved users in secure environments. OpenAI has built in systems to detect potentially harmful activity. The access restrictions are a direct response to researcher warnings about the potential misuse of biological models for designing dangerous pathogens.

Organizations requesting access should use the form: https://openai.com/form/life-sciences-access/
Updated: 18 Apr 2026↗ Security documentation