aidevblogs
⌘K
BlogsVideosTweets
AllLLMsComputer VisionMLOpsAgentsData EngineeringResearchSafety
Reward Hacking in Reinforcement Learning
Lilian Weng·lilianweng.github.io·about 1 year ago·LLMs
Extrinsic Hallucinations in LLMs
Lilian Weng·lilianweng.github.io·over 1 year ago·LLMs
Thinking about High-Quality Human Data
Lilian Weng·lilianweng.github.io·almost 2 years ago·LLMs
Adversarial Attacks on LLMs
Lilian Weng·lilianweng.github.io·about 2 years ago·LLMs
LLM Powered Autonomous Agents
Lilian Weng·lilianweng.github.io·over 2 years ago·LLMs
Prompt Engineering
Lilian Weng·lilianweng.github.io·almost 3 years ago·LLMs
The Transformer Family Version 2.0
Lilian Weng·lilianweng.github.io·almost 3 years ago·LLMs
Generalized Visual Language Models
Lilian Weng·lilianweng.github.io·over 3 years ago·LLMs
Learning with not Enough Data Part 3: Data Generation
Lilian Weng·lilianweng.github.io·over 3 years ago·LLMs
Reducing Toxicity in Language Models
Lilian Weng·lilianweng.github.io·almost 5 years ago·LLMs
Controllable Neural Text Generation
Lilian Weng·lilianweng.github.io·almost 5 years ago·LLMs