话题精选
读书
旅行
好物
极客
个人总结
所有博客
Josherich
home sweet home
访问博客
Stanford CS336 Language Modeling from Scratch - Spring 2025 - Mixture of experts
Jeff Dean’s talk at ETH Zurich in April 2025 on important trends in AI
Model Architecture Design for Modern Hardware with Tri Dao
#135. DeepSeek 股权架构分析 - 揭秘AI独角兽的资本迷宫
Chips: Liberated? Trump’s Semis Tariff Gambit
Stanford CS336: Language Modeling from Scratch | Spring 2025 | Architectures, Hyperparameters
Breaking Huawei + Tariffs Done Right with SemiAnalysis and Asianometry
#40 - Lewis Bollard - How to End Factory Farming
Sinica Live at Columbia University, with Yawei Liu and Yukon Zhang
A Toy Manufacturer Explains How Trump’s Tariffs Could Crush His Industry
Cancer Detection (The Derby Mill Series ep 09)
How does LMQL work and Why is it dead?
How our time online is changing (ft. CEO of Microsoft AI, Mustafa Suleyman)
Animating the Steinhaus-Johnson-Trotter Permutation Generation Algorithm
398 「赤脚医生」兴衰史:大时代中的医疗下乡运动
94.《夜访吸血鬼》:爱与永生,是比死亡更深的诅咒
South China Sea Update: Will the U.S. Really Defend the Philippines Against China?
SF Compute: Commoditizing Compute
Incels, Evo Psych, and Modern Literature with ARX-Han — #83
在创作中把你困住的是什么?重轻与汉洋仔细掰扯创作
EMERGENCY POD: Liberation Day with Tanner Greer of Scholarstage
Lambda Days 2015 - Torben Hoffmann - Thinking like an Erlanger
Liberation Day, Tariffs, US v China Open Source, OpenAI Fundraise, $CRWV, TikTok | BG2 w/ Bill Gurley & Brad Gerstner
#39 - Daniel Kokotajlo - Wargames, Superintelligence & Quitting OpenAI
The Road to Zig 1.0 - Andrew Kelley
Why does it take so long to write to an array with one billion elements in Python?
Alan Kay, 2015: Power of Simplicity
Andrew Kelley Practical Data Oriented Design (DoD)
Lessons From Southeast Asia on How to Manage Great Power Rivalries
The 1000x faster financial database (Interview)
The Only Unbreakable Law
Building Manus AI (first ever Manus Meetup)
Shortwave Rides the Tidal Wave: Inbox Agents, Hyper-Growth & Hiring AI Managers, with CEO Andrew Lee
Slavoj Zizek: The Reality of the Virtual
The Agent Network — Dharmesh Shah
The Last 2 Months — and Next 2 Years — of U.S. Politics
What does Palantir actually do?
【哔说】8年只做游戏纪录片 导演BK的钱从哪来
Callum Williams: Economics, AI, and Technological Progress — #82
Code Context is King: Augment’s AI Assistant for Professional Software Engineers, with Guy Gur-Ari
And, This is Ezra Klein This is Gavin Newsom
395 辛德勇谈海昏侯墓背后的西汉废立往事
GRPO’s new variants and implementation secrets
a16z on AI Voices: Call Centers, Coaches, and Companions with Olivia Moore & Anish Acharya
Scott Bessent | All-In in DC!
393 在巴黎教会档案里发现一座中国东北村庄
Building Compute in America
与马毅聊智能史:“DNA是最早的大模型”,智能的本质是减熵
392 国际援助遭遇真空时刻:从USAID停摆说起
CMU Advanced NLP Spring 2025 (15): Quantization (Guest: Tim Dettmers)
与Haivivi李勇聊月入千万的AI Jellycat:小众AI硬件×大众消费品的交叉口创业
Why Trump’s Tariffs Won’t Work
Live in Berkeley: Jessica Chen Weiss and Ryan Hass on the U.S. and China in 2025
Text Completion Fine-tuning from PDF Files in Colab
十字路口的德国:大选之后,政治版图如何重塑
390 「假肉驱逐真肉」的背后:牛肉产业全球化挤压下的游牧饮食
Deep Learning Day: Generative Modeling
I Raised $300M To Bring AI To Laywers | Winston Weinberg & Harvey
AMD RX9070XT显卡评测:暴打50系,A卡支棱起来了!
Stanford CS224N: NLP with Deep Learning | Spring 2024 | Lecture 15 - After DPO by Nathan Lambert
Carl Zimmer on the Hidden Life in the Air We Breathe
Daniel Spielman “Miracles of Algebraic Graph Theory”
Making A Browser Is Harder Than You Think (Ft Andreas Kling)
Dramatically improve microscope resolution with an LED array and Fourier Ptychography
Driving Down The Cost of Next-Generation CPUs
Paper walkthrough: rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Epstein Files Flop, State of the Market, Autonomous Robots, Trump’s Gold Card, Friedberg on Jeopardy
The Ultra Scale Playbook
The Dark Heart of Trump’s Foreign Policy
Reformat Long Tweets
Building scalable systems for automatically understanding LLMs
Open Operator, Serverless Browsers and the Future of Computer-Using Agents
The End of Reading
When Nanoseconds Matter: Ultrafast Trading Systems in C++ - David Gross - CppCon 2024
C++ Data Structures That Make Video Games Go Round - Al-Afiq Yeong - CppCon 2024
387 世界大战、铁幕外宣与广播攻势:从马斯克威胁关停VOA与RFE谈起
HC2024-S7: High-Performance Processors Part 2
Weak-to-Strong Generalization
Terence Tao - Machine-Assisted Proofs (February 19, 2025)
Deep dive on going from Vue to Htmx in a large-scale production app
Sam Bankman-Fried Speaks To The New York Sun From Prison
午后偏见038|听施小炜谈村上春树和他的时代
The Stablecoin Future, Milei’s Memecoin, DOGE for the DoD, Grok 3, Why Stripe Stays Private
Deep dive on going from Vue to Htmx in a large-scale production app
Sam Bankman-Fried Speaks To The New York Sun From Prison
DPDK in Databases: Why Isn’t It More Common? - Owen Hilyard, University of New Hampshire
Inference Scaling, Alignment Faking, Deal Making? Frontier Research with Ryan Greenblatt of Redwood
DPDK in Databases: Why Isn’t It More Common? - Owen Hilyard, University of New Hampshire
Inference Scaling, Alignment Faking, Deal Making? Frontier Research with Ryan Greenblatt of Redwood
CMU Advanced NLP Spring 2025 (11): Reinforcement Learning
How DeepSeek changes the LLM story - Sasha Rush
Lecture 44: NVIDIA Profiling
Test-time Regression - Alex Wang | ASAP Seminar #01
π0: A Foundation Model for Robotics with Sergey Levine - 719
GPU Mode Lecture 32: Unsloth
GPU Mode Lecture 32: Unsloth
385 哪吒东游、莲生埃及与十九世纪火山爆发下的全球史
陈良榕:你需要知道的张忠谋与台积电
#09 - Search Parallelization: Bottom-up (CMU Optimize!)
91.《乔瓦尼的房间》:为爱活,也为爱死,虐恋怎么不是爱?!
查看更多