aidevblogs
⌘K
BlogsVideosTweets
AllLLMsComputer VisionMLOpsAgentsData EngineeringResearchSafety
Introducing OpenAI Academy for News Organizations
OpenAI Blog·openai.com·9 days ago·MLOps
Opening the black box of character training
Interconnects (Nathan Lambert)·interconnects.ai·about 2 months ago·MLOps
xAI’s Colossus 2 – First Gigawatt Datacenter In The World, Unique RL Methodology, Capital Raise
SemiAnalysis·semianalysis.com·3 months ago·MLOps
Another Giant Leap: The Rubin CPX Specialized Accelerator & Rack
SemiAnalysis·semianalysis.com·4 months ago·MLOps
H100 vs GB200 NVL72 Training Benchmarks – Power, TCO, and Reliability Analysis, Software Improvement Over Time
SemiAnalysis·semianalysis.com·4 months ago·MLOps
Sparse Networks from Scratch: Faster Training without Losing Performance
Tim Dettmers·timdettmers.com·over 6 years ago·MLOps
Calculus on Computational Graphs: Backpropagation
Chris Olah·colah.github.io·over 10 years ago·MLOps
Mapping NVIDIA’s Full GenAI Toolchain
MLOps Community·mlops.community·16 days ago·MLOps
Accelerating DenseNet-121 Inference NVIDIA
MLOps Community·mlops.community·about 1 month ago·MLOps
OVHcloud on Hugging Face Inference Providers 🔥
Hugging Face Blog·huggingface.co·about 1 month ago·MLOps
More tales about outages and numeric limits
Rachel by the Bay·rachelbythebay.com·about 1 month ago·MLOps
Import AI 435: 100k training runs; AI systems absorb human power; intelligence per watt
Import AI·importai.substack.com·about 1 month ago·MLOps
Slides-To-Translate: When IT Says No, Build a $0.04 Solution on Your Lunch Break
MLOps Community·mlops.community·2 months ago·MLOps
Import AI 429: Eval the world economy; singularity economics; and Swiss sovereign AI
Import AI·importai.substack.com·3 months ago·MLOps
Recreating the US/* time zone situation
Rachel by the Bay·rachelbythebay.com·3 months ago·MLOps
AI research is a max-performance domain
Jason Wei·jasonwei.net·7 months ago·MLOps
Some thoughts on how control over web content works
Rachel by the Bay·rachelbythebay.com·8 months ago·MLOps
Shape, Symmetries, and Structure: The Changing Role of Mathematics in Machine Learning Research
The Gradient·thegradient.pub·about 1 year ago·MLOps
Large Transformer Model Inference Optimization
Lilian Weng·lilianweng.github.io·almost 3 years ago·MLOps
Some Math behind Neural Tangent Kernel
Lilian Weng·lilianweng.github.io·over 3 years ago·MLOps
How to Train Really Large Models on Many GPUs?
Lilian Weng·lilianweng.github.io·over 4 years ago·MLOps
Mixture of Attention Schemes (MoAS): Learning to Route Between MHA, GQA, and MQA
arXiv CS.AI·arxiv.org·1 day ago·MLOps
From Pilots to Practices: A Scoping Review of GenAI-Enabled Personalization in Computer Science Education
arXiv CS.AI·arxiv.org·1 day ago·MLOps
Adversarial Training for Failure-Sensitive User Simulation in Mental Health Dialogue Optimization
arXiv CS.CL·arxiv.org·1 day ago·MLOps
Zero-Training Temporal Drift Detection for Transformer Sentiment Models: A Comprehensive Analysis on Authentic Social Media Streams
arXiv CS.LG·arxiv.org·1 day ago·MLOps
SHRP: Specialized Head Routing and Pruning for Efficient Encoder Compression
arXiv CS.LG·arxiv.org·1 day ago·MLOps
Forecasting N-Body Dynamics: A Comparative Study of Neural Ordinary Differential Equations and Universal Differential Equations
arXiv CS.LG·arxiv.org·1 day ago·MLOps
MaskOpt: A Large-Scale Mask Optimization Dataset to Advance AI in Integrated Circuit Manufacturing
arXiv CS.LG·arxiv.org·1 day ago·MLOps
Forward Only Learning for Orthogonal Neural Networks of any Depth
arXiv CS.LG·arxiv.org·1 day ago·MLOps