aidevblogs
⌘K
BlogsVideosTweets
AllLLMsComputer VisionMLOpsAgentsData EngineeringResearchSafety
BitRL-Light: 1-bit LLM Agents with Deep Reinforcement Learning for Energy-Efficient Smart Home Lighting Optimization
arXiv CS.AI·arxiv.org·1 day ago·LLMs
MegaRAG: Multimodal Knowledge Graph-Based Retrieval Augmented Generation
arXiv CS.AI·arxiv.org·1 day ago·LLMs
MicroProbe: Efficient Reliability Assessment for Foundation Models with Minimal Data
arXiv CS.AI·arxiv.org·1 day ago·LLMs
Erkang-Diagnosis-1.1 Technical Report
arXiv CS.AI·arxiv.org·1 day ago·LLMs
Reasoning Relay: Evaluating Stability and Interchangeability of Large Language Models in Mathematical Reasoning
arXiv CS.AI·arxiv.org·1 day ago·LLMs
AIAuditTrack: A Framework for AI Security system
arXiv CS.AI·arxiv.org·1 day ago·LLMs
Memory Bear AI A Breakthrough from Memory to Cognition Toward Artificial General Intelligence
arXiv CS.AI·arxiv.org·1 day ago·LLMs
AI-Driven Decision-Making System for Hiring Process
arXiv CS.AI·arxiv.org·1 day ago·LLMs
From Fake Focus to Real Precision: Confusion-Driven Adversarial Attention Learning in Transformers
arXiv CS.AI·arxiv.org·1 day ago·LLMs
Quantifying Laziness, Decoding Suboptimality, and Context Degradation in Large Language Models
arXiv CS.AI·arxiv.org·1 day ago·LLMs
Eidoku: A Neuro-Symbolic Verification Gate for LLM Reasoning via Structural Constraint Satisfaction
arXiv CS.AI·arxiv.org·1 day ago·LLMs
AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent
arXiv CS.AI·arxiv.org·1 day ago·LLMs
A Benchmark for Evaluating Outcome-Driven Constraint Violations in Autonomous AI Agents
arXiv CS.AI·arxiv.org·1 day ago·LLMs
Safety Alignment of LMs via Non-cooperative Games
arXiv CS.AI·arxiv.org·1 day ago·LLMs