aidevblogs
⌘K
BlogsVideosTweets
AllLLMsComputer VisionMLOpsAgentsData EngineeringResearchSafety
Why We Think
Lilian Weng·lilianweng.github.io·8 months ago·Research
Reward Hacking in Reinforcement Learning
Lilian Weng·lilianweng.github.io·about 1 year ago·LLMs
Extrinsic Hallucinations in LLMs
Lilian Weng·lilianweng.github.io·over 1 year ago·LLMs
Diffusion Models for Video Generation
Lilian Weng·lilianweng.github.io·over 1 year ago·Computer Vision
Thinking about High-Quality Human Data
Lilian Weng·lilianweng.github.io·almost 2 years ago·LLMs
Adversarial Attacks on LLMs
Lilian Weng·lilianweng.github.io·about 2 years ago·LLMs
LLM Powered Autonomous Agents
Lilian Weng·lilianweng.github.io·over 2 years ago·LLMs
Prompt Engineering
Lilian Weng·lilianweng.github.io·almost 3 years ago·LLMs
The Transformer Family Version 2.0
Lilian Weng·lilianweng.github.io·almost 3 years ago·LLMs
Large Transformer Model Inference Optimization
Lilian Weng·lilianweng.github.io·almost 3 years ago·MLOps
Some Math behind Neural Tangent Kernel
Lilian Weng·lilianweng.github.io·over 3 years ago·MLOps
Generalized Visual Language Models
Lilian Weng·lilianweng.github.io·over 3 years ago·LLMs
Learning with not Enough Data Part 3: Data Generation
Lilian Weng·lilianweng.github.io·over 3 years ago·LLMs
Learning with not Enough Data Part 2: Active Learning
Lilian Weng·lilianweng.github.io·almost 4 years ago·Data Engineering
Learning with not Enough Data Part 1: Semi-Supervised Learning
Lilian Weng·lilianweng.github.io·about 4 years ago·Data Engineering
How to Train Really Large Models on Many GPUs?
Lilian Weng·lilianweng.github.io·over 4 years ago·MLOps
What are Diffusion Models?
Lilian Weng·lilianweng.github.io·over 4 years ago·Computer Vision
Contrastive Representation Learning
Lilian Weng·lilianweng.github.io·over 4 years ago·Computer Vision
Reducing Toxicity in Language Models
Lilian Weng·lilianweng.github.io·almost 5 years ago·LLMs
Controllable Neural Text Generation
Lilian Weng·lilianweng.github.io·almost 5 years ago·LLMs