Jeff Wu Profile
Jeff Wu

@WuTheFWasThat

603
Followers
346
Following
0
Media
7
Statuses

Joined July 2009
Don't wanna be here? Send us removal request.
@WuTheFWasThat
Jeff Wu
5 months
<3
@janleike
Jan Leike
5 months
To all OpenAI employees, I want to say: Learn to feel the AGI. Act with the gravitas appropriate for what you're building. I believe you can "ship" the cultural change that's needed. I am counting on you. The world is counting on you. :openai-heart:
241
412
5K
3
5
90
@WuTheFWasThat
Jeff Wu
10 months
@StephenLCasper see Figure 13 :)
0
0
3
@WuTheFWasThat
Jeff Wu
10 months
@StephenLCasper See 5.1.2 + 5.1.3. In a pure denoising regime, you would see student-gt agreement>student-supervisor agreement, and positive student-supervisor agreement scaling with student compute. So we're at least partially in debiasing regime. We probably should've discussed this!
0
0
4
@WuTheFWasThat
Jeff Wu
7 months
@wesg52 Unsurprising I think. SAE errors will be correlated with directions relevant to loss, since the activation itself is optimized for loss. Only a very good SAE's errors would match noise - it'd have to hit the target with low precision rather than miss b/c of flaws in modeling
0
0
6