Visualizing Weights Author: Voss, Chelsea; Cammarata, Nick; Goh, Gabriel; Petrov, Michael; Schubert, Ludwig; Egan, Ben; Lim, Swee; Olah, Chris Publication: Distill Year: 2021
Multimodal Neurons in Artificial Neural Networks Author: Goh, Gabriel; Cammarata, Nick; Voss, Chelsea; Carter, Shan; Petrov, Michael; Schubert, Ludwig; Radford, Alec; Olah, Chris Publication: Distill Year: 2021
Visualizing and Understanding Convolutional Networks Author: Zeiler, Matthew D.; Fergus, Rob Year: 2014
Deep neural networks are easily fooled: High confidence predictions for unrecognizable images Author: Nguyen, Anh; Yosinski, Jason; Clune, Jeff
Feature Visualization Author: Olah, Chris; Mordvintsev, Alexander; Schubert, Ludwig Publication: Distill Year: 2017
The Building Blocks of Interpretability Author: Olah, Chris; Satyanarayan, Arvind; Johnson, Ian; Carter, Shan; Schubert, Ludwig; Ye, Katherine; Mordvintsev, Alexander Publication: Distill Year: 2018
Experiments in Handwriting with a Neural Network Author: Carter, Shan; Ha, David; Johnson, Ian; Olah, Chris Publication: Distill Year: 2016
Activation Atlas Author: Carter, Shan; Armstrong, Zan; Schubert, Ludwig; Johnson, Ian; Olah, Chris Publication: Distill Year: 2019
A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity Author: Longpre, Shayne; Yauney, Gregory; Reif, Emily; Lee, Katherine; Roberts, Adam; Zoph, Barret; Zhou, Denny; Wei, Jason; Robinson, Kevin; Mimno, David; Ippolito, Daphne Year: 2023
Pooling And Attention: What Are Effective Designs For LLm-Based Embedding Models? Author: Tang, Yixuan; Yang, Yi Year: 2024
Homogenization Effects of Large Language Models on Human Creative Ideation Author: Anderson, Barrett R; Shah, Jash Hemant; Kreminski, Max Year: 2024
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations Author: Orgad, Hadas; Toker, Michael; Gekhman, Zorik; Reichart, Roi; Szpektor, Idan; Kotek, Hadas; Belinkov, Yonatan Year: 2024
"I wouldn’t say offensive but…": Disability-Centered Perspectives on Large Language Models Author: Gadiraju, Vinitha; Kane, Shaun; Dev, Sunipa; Taylor, Alex; Wang, Ding; Denton, Emily; Brewer, Robin Year: 2023
Having Beer after Prayer? Measuring Cultural Bias in Large Language Models Author: Naous, Tarek; Ryan, Michael J.; Ritter, Alan; Xu, Wei Year: 2024
An Overview of Catastrophic AI Risks Author: Hendrycks, Dan; Mazeika, Mantas; Woodside, Thomas Year: 2023
AI's Economic Peril Author: Bell, Stephanie A.; Korinek, Anton Publication: Journal of Democracy Year: 2023