Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity4просмотра9 месяцев назад
Extension OL-MDISF: Online Learning from Mix-Typed, Drifted, and Incomplete Streaming Features7просмотров9 месяцев назад
DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering16просмотров10 месяцев назад
Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety11просмотров10 месяцев назад
Tree-Structured Parzen Estimator Can Solve Black-Box Combinatorial Optimization More Efficiently9просмотров10 месяцев назад
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination4просмотра10 месяцев назад
Giving AI Agents Access to Cryptocurrency and Smart Contracts Creates New Vectors of AI Harm4просмотра10 месяцев назад
Gemini 2.5: Advancing Reasoning, Multimodality, Long Context, and Agentic Capabilities11просмотров10 месяцев назад
Сбербанк: Главные ИИ-прорывы и укрепление экосистемы во II квартале 2025!15просмотров10 месяцев назад
Working with AI: Measuring the Occupational Implications of Generative AI13просмотров10 месяцев назад
RAG-R1: Incentivizing Search and Reasoning Capabilities of LLMs Through Multi-Query Parallelism41просмотр10 месяцев назад