Bridging the Digital to Physical Divide: Evaluating LLM Agents on Benchtop DNA Acquisition
This report describes an automatable evaluation of eight frontier large language model (LLM) agents on their ability to design DNA segments, interact with a benchtop DNA synthesizer, and generate laboratory protocols.