Social media has some very real toxicity challenges, like harassment and cyberbullying, that diminish its utility for collective and democratic efforts, and I study human-AI content moderation to address those challenges (NSF Grant, Belfer Fellowship, Mozilla Grant, Nayar Prize). I also study data curation, especially how we evaluate (a) the impacts of data reuse and (b) investments in curating and disseminating research data. My work has been funded by the National Science Foundation, the Institute for Museum and Library Services, the Nayar Prize, Mozilla, the Anti-Defamation League, Amazon, and DiscoverText.
I have a few titles at the University of Michigan:
- Associate Professor, School of Information
- Research Associate Professor, Institute for Social Research
Associate Professor, Digital Studies Institute, College of Literature, Science, and the Arts
- Director, Resource Center for Minority Data, ICPSR
- Director, Social Media Archive, ICPSR
- Associate Director, Center for Social Media Research
Work with Me
Thank you for your interest! I’m recruiting postdocs through the Michigan Data Science Fellows program:
The Michigan Data Science Fellows program is now accepting applications for the 2023 cohort. I’d be willing to mentor someone interested in using computational approaches to understand social media experiences of historically marginalized populations. These projects would fit the Enhancing Scientific and Societal Impact or Measuring and Improving Society themes for the program. For instance, projects in this space grow from seeds like: what’s the difference in viewership and promotion for fashion YouTubers of different races? What rates of content deletion do women and nonbinary folks face compared to men on Reddit? How do the demographics of the labelers influence supervised machine learning models for toxicty detection?
Candidates and mentors must apply jointly by January 15. Email me by December 31 if you’re interested in applying together.
Current Funded Research Projects
Get in Touch
Email is best.
Here’s a PDF of my CV and some recent publications:
Data Curation and Archiving
- Hemphill, L., Schöpke-Gonzalez, A., & Panda, A. (2022). Comparative sensitivity of social media data and their acceptable use in research. Scientific Data, 9(1), 643. doi: 10.1038/s41597-022-01773-w
- Lafia, S., Fan, L., Thomer, A.K., and Hemphill, L. (2022) Subdivisions and Crossroads: Identifying Hidden Community Structures in a Data Archive’s Citation Network. Quantitative Science Studies. doi: 10.1162/qss_a_00209.
- Thomer, A. K., Akmon, D., York, J., Tyler, A. R. B., Polasak, F., Lafia, S., Hemphill, L., & Yakel, E. (2022). The craft and coordination of data curation: complicating “workflow” views of data science. Proceedings of the ACM on Human Computer Interaction (PACM HCI). doi: 10.7302/4017.
- Lafia, S., Fan, L., & Hemphill, L. (2022) A Natural Language Processing Pipeline for Detecting Informal Data References in Academic Literature. Proceedings of the Association for Information Science and Technology (ASIS&T).
- Hemphill, L., Pienta, A., Lafia, S., Akmon, D., and Bleckley, D. (2022) How do Properties of Data, Their Curation, and Their Funding Relate to Reuse?) Journal of the Association for Information Science and Technology. doi: 10.1002/asi.24646.
- Lafia, S., Thomer, A., Bleckley, D., Akmon, D., & Hemphill, L. (2021). Leveraging Machine Learning to Detect Data Curation Activities. Proceedings of 2021 IEEE 17th International Conference on E-Science. http://arxiv.org/abs/2105.00030
- Hemphill, L., Hedstrom, M. L., & Leonard, S. H. (2020). Saving social media data: Understanding data management practices among social media researchers and their implications for archives. Journal of the Association for Information Science and Technology, 3, 34. https://doi.org/10.1002/asi.24368
Social Media, Content Moderation, and Extremism
- Schöpke-Gonzalez, A., Atreja, S., Shin, H. N., Ahmed, N., & Hemphill, L. (2022). Why do volunteer content moderators quit? Burnout, conflict, and harmful behaviors. New Media & Society. doi: 10.1177/14614448221138
- Hemphill, L. (2022) Very Fine People: What Social Media Platforms Miss About White Supremacist Speech. The Anti-Defamation League.
- Balasubramanian, S.K., Bilgic, M., Culotta, A., Hemphill, L., Nikolich, A., Shapiro, M.A., (2022) Leaders or Followers? A Temporal Analysis of Tweets from IRA Trolls. 16th International AAAI Conference on Web and Social Media (ICWSM 2022).
- Hemphill, L., Million, A. J., & Erickson, I. (2021). How nonprofits use Facebook to craft infrastructure. First Monday. https://doi.org/10.5210/fm.v26i3.10265
- Im, J., Chandrasekharan, E., Sargent, J., Lighthammer, P., Denby, T., Bhargava, A., Hemphill, L., Jurgens, D., & Gilbert, E. (2020). Still out there: Modeling and Identifying Russian Troll Accounts on Twitter. 12th ACM Conference on Web Science, 1–10. https://doi.org/10.1145/3394231.3397889 (* Best Paper Runner-up)
- Jurgens, D., Chandrasekharan, E., and Hemphill, L. (2019) A Just and Comprehensive Strategy for Using NLP to Address Online Abuse. Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019). Florence, Italy.
- Liu, P., Guberman, J., Hemphill, L., & Culotta, A. (2018). Forecasting the presence and intensity of hostility on Instagram using linguistic and social features. In Proceedings of the 12th International Conference on Web and Social Media. Stanford, CA, USA.
- Hemphill, L., Russell, A., & Schöpke-Gonzalez, A. M. (2020). What Drives U.S. Congressional Members’ Policy Attention on Twitter? Policy & Internet. https://doi.org/10.1002/poi3.245
- Hemphill, L., and Schöpke-Gonzalez, A.M. (2020) Two Computational Models for Analyzing Political Attention in Social Media. International AAAI Conference on Web and Social Media (ICWSM 2020), 260-271.
- Hemphill, L. and Shapiro, M.A. (2019) Appealing to the Base or to the Moveable Middle? Incumbents’ Partisan Messaging Before the 2016 U.S. Congressional Elections. Journal of Information Technology and Politics. doi: 10.1080/19331681.2019.1651685
- Shapiro, M.A. and Hemphill, L. (2017). Politicians and the Policy Agenda: Does Use of Twitter by the U.S. Congress Direct New York Times Content? Policy and Internet . doi: 10.1002/poi3.120
- Hemphill, L., Culotta, A., & Heston, M. (2016). #Polar Scores: Measuring Partisanship Using Social Media Content. Journal of Information Technology & Politics. 13(4). doi: 10.1080/19331681.2016.1214093
Some of my work that’s not (yet) published. These are workshop papers, papers under review, and/or longer versions of papers for conferences that review only abstracts during submission.
- Atreja, S., Hemphill, L., Resnick, P. (2022) What is the Will of the People? Moderation Preferences for Misinformation
- Lafia, S., Ko, J-W., Moss, E., Kim, J., Thomer, A., Hemphill, L. (2021) Detecting Informal Data References in Academic Literature.
- Hemphill, L., Leonard, S.H., and Hedstrom, M. (2018) Developing a Social Media Archive at ICPSR
- Hemphill, L. (2018) More Specificity, More Attention to Social Context: Reframing How We Address ‘‘Bad Actors’’
- Hemphill, L., Culotta, A., and Heston, M. (2013) Framing in Social Media: How the US Congress Uses Twitter Hashtags to Frame Political Issues
You can find more at
- Deep Blue (Michigan’s institutional repository)
- arXiv.org (open access e-prints)
- SSRN (though I won’t post new papers here now that Elsevier owns it)