AI models are now everywhere, from hospitals to churches.
The astonishing thing is that even AI experts still don’t know exactly what’s happening inside these black box models, even as they’re being deployed in the highest-stakes settings imaginable.The latest strategy to figure it out: studying them like biological systems.
For example, MIT Tech Review reports, scientists at Anthropic have developed tools that let them trace what’s happening inside models as they perform a task, a type of study called mechanistic interpretability — which resembles how doctors use MRIs to study brain activity, another type of intelligence we don’t quite understand yet.
“This is very much a biological type of analysis,” Josh Batson, a research scientist at Anthropic, told Tech Review. “It’s not like math or physics.”
To read more, click here.