New Frontier Models Are Faster, Not More…

Apr 29

Overall accuracy for GPT-5.5 and Opus 4.7 remains flat on SpatialBench. Scientist-reviewed trajectories reveal persistent gaps in assay-aware biological judgment.

Read →

3 Comments

Adil Yusuf

May 1

why not provide data analysis instructions/examples that are provided by the assay suppliers where possible? is the expectation that the agents should be able to retrieve this on their own?

Reply (2)

Kenny Workman

May 1

Yes, we construct tasks with the kind of realistic context you would give an experienced scientist.

Adil Yusuf

May 1

this makes sense.

my expectation would be that if the agents have gotten better at general intelligence, then perhaps we would see performance gains when instructions/examples are provided. not specified instructions that the user makes, just the instructions/examples from the assay supplier.

but i agree that the agents should have ideally been able to retrieve this themselves.

LatchBio

New Frontier Models Are Faster, Not More…