OpenThoughts-Agent: Data Recipes for Agentic Models
World Models in Pieces: Structural Certification for General Agents
Matching Tasks to Objectives: Fine-Tuning and Prompt-Tuning Strategies for Encoder-Decoder Pre-trained Language Models
Grading the Grader: Lessons from Evaluating an Agentic Data Analysis System
Accuracy and Satisfaction in Multi-Turn LLM Dialogues for NFR Assessment
Difference-Making without Making a Difference
Solving Inverse Problems of Chaotic Systems with Bidirectional Conditional Flow Matching
Assessing Distribution Shift in Human Activity Recognition for Domain Generalization
BluTrain: A C++/CUDA Framework for AI Systems
Can Scale Save Us From Plasticity Loss in Large Language Models?
Scaling Laws for Task-Specific LLM Distillation
Decentralised AI Training and Inference with BlockTrain
Cost-Optimal Decision Diagrams for Stochastic Boolean Function Evaluation
LaGO: Latent Action Guidance for Online Reinforcement Learning
CineCap: Structured Reasoning with Spatio-Temporal Anchors for Cinematographic Video Captioning
SAFARI: Scaling Long Horizon Agentic Fault Attribution via Active Investigation
Themis: An explainable AI-enabled framework for Reinforcement Learning with Human Feedback
When CQs Go Wrong: Challenges in CQ Verification with OE-Assist
Abstractions of Queries in Ontology-Based Data Access
AI Tokenomics: The Economics of Tokens, Computation, and Pricing in Foundation Models