6/12/2025
yes im very late
this article argues about how mech interp is not that useful and it raises a lot of good points (i.e. in biology / other complex systems, we don't do the bottom up approach, and a lot of the buzz from mech interp comes from cherrypicked results, the compression involved leads to the loss of edge cases, so much has been invested but not much has come out, this post talks about how google stopped prioritizing SAEs because they weren't performing as well). i think i should definitely keep this in mind as i get more interested in interpretability; i've heard friends make such comments too. meanwhile this article by dario amodei urges for the importance of mech interp, tho he is from anthropic which may influence his viewpoints. he advocates for the govt to also get involved, citing all the bad disasters that may happen because ai is too "opaque". i get his points too; the race between the development of ai models and our understanding of them is pretty concerning. anyways, i think i'll still be exploring interpretability which the first article acknowledges is more interesting to those coming from a math background who enjoy rigor and proofs. i'll just keep both sides in mind, and the reference to focusing on top to bottom approaches is valid, with developments in ai safety like representation engineering etc. here's a really neat resource with lots of resources for ai safety - https://www.aisafety.com/map
cool application of equidistributed sequences to isl problem
tessellation thm - Wallace-Bolyai-Gerwin theorem
Comments
Post a Comment