PHD Discussions Logo

Ask, Learn and Accelerate in your PhD Research

Question Icon Post Your Answer

Question Icon

What are the ethical boundaries when using publicly available social media data for research?

My project involves analyzing tweets. The data is public, but the users didn't consent to be in my study. Do I need IRB approval? How do I ethically handle quoting tweets or protecting users' identities?

All Answers (1 Answers In All)

By Olu Answered 1 year ago

"Public" does not mean "ethically free to use." The core principles are minimizing harm and respecting context. You likely need IRB approval—many universities require it for any human subjects data, regardless of source. Key practices: 1) Anonymize aggressively: remove usernames, user IDs, and any identifying information. Don't quote tweets verbatim if a search could reveal the user. 2) Consider sensitivity: Research on mental health or political dissent using public posts carries higher risk. 3) Check platform Terms of Service—some prohibit scraping. 4) Practice data minimization: only collect what you need. The field is evolving, but the ethical burden is on the researcher to not treat people as mere data points. When in doubt, consult your IRB and follow the Association of Internet Researchers' guidelines.

Your Answer