Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation9просмотров6 месяцев назад
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control7просмотров6 месяцев назад
BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design8просмотров6 месяцев назад
GOEDEL-PROVER-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction2просмотра6 месяцев назад