As research datasets and analyses grow in complexity, data that could be valuable to other researchers and to support the integrity of published work remain undiscoverable across disciplines. These data are especially concentrated in the “Long Tail” of funded research (Heidorn, 2008) where curation resources and related expertise are often inaccessible. In the domain of astronomy, it is widely acknowledged that uncurated "dark data" exist, but the scope of the problem remains uncertain. Following expert feedback obtained through two workshops held in 2015 and 2016 for the “Astrolabe” project, my PhD dissertation research aims to define the properties of uncurated or at-risk astronomical data, and to identify strategies for locating potentially-useful data to be targeted for curation. This mixed-methods project will enhance our understanding of the nature and prevalence of astronomical dark data and characterize astronomy’s Long Tail by: 1) Conducting background research through content analysis, targeted interviews, participant observation, and workshops; 2) Developing heuristics for searching the astronomical literature for references to potentially uncurated or at-risk data by identifying statistically-significant attributes of publications and authors, and through text classification; and 3) Surveying authors of a sample of journal publications about data and research practices.
University of Arizona, PhD, School of Information
Expected completion: December 2018
Dissertation Committee Members: Dr. P. Bryan Heidorn (Chair), Director, UA School of Information; Dr. Yotam Shmargad, Assistant Professor, UA School of Information; Dr. Erin Leahey, Professor, UA School of Sociology; Dr. John Mester, Associate Vice President for Research, UA Research, Discovery & Innovation
Dissertation title: "Exploring the Long Tail of Astronomy: A Mixed- Methods Approach to Searching for Dark Data through the Lens of the Scholarly Literature"
Clarion University of Pennsylvania, Master of Science, Library Science
New Mexico Institute of Mining and Technology, Bachelor of General Studies
Instructor, ESOC 211 - Collaborating in Online Communities, University of Arizona School of Information
June 2017 - present
Astrolabe Graduate Research Associate, University of Arizona School of Information
August 2015 - August 2017
Research Development Fellow, University of Arizona Office for Research & Discovery
July 2015 - July 2016
Graduate Student Assistant, American Astronomical Society
May 2015 - August 2015
Teaching Assistant, University of Arizona School of Information
August 2014 - May 2015
Graduate Student Intern, Large Synoptic Survey Telescope (LSST)
May 2014 - August 2014
Graduate Research Assistant, University of Arizona School of Information Resources and Library Science
August 2013 - May 2014
March 2012 - present
Writer and Web Editor, The Santiago Times
May 2012 - January 2013
Document Specialist, Atacama Large Millimeter/submillimeter Array (ALMA)
January 2007 - July 2012
Assistant Circulation Supervisor, New Mexico Institute of Mining and Technology Joseph R. Skeen Library
August 2001 - March 2007
Heidorn, P. B., Stahlman, G.R., & Steffen, J. (Under Review). Astrolabe: Curating, linking and computing astronomy’s dark data. Astrophysical Journal Supplement Series.
Brooks, C. F., Heidorn, P. B., Stahlman, G. R., and Chong, S. S. (2016). Working beyond the confines of academic discipline to resolve a real-world problem: A community of scientists discussing long-tail data in the cloud. First Monday.
CONFERENCE PROCEEDINGS AND WORKSHOPS:
Stahlman, G.R., Heidorn, P.B., Steffen, J. (In Press). Identifying and curating astronomical “dark data” through development of cyberinfrastructure resources. Proceedings of Library and Information Services in Astronomy VIII (LISA VIII), June 7-9, 2017, Strasbourg, France.
Brooks, C. F., Heidorn, P.B., Stahlman, G., and Chong, S. (2015). Discourses, a community of scientists, and long-tail data in the cloud. Paper submitted for the annual meeting of the Association of Internet Researchers (AoIR), October 21-24, Phoenix, AZ.
Bhatia, R., et al. (2012). System engineering of the Atacama Large Millimeter/submillimeter Array. Proceedings of the International Society for Optics and Photonics, volume 8449. doi: 10.1117/12.926822.
Heidorn, P.B., Stahlman, G.R., and Chong, S. (2015). Datasphere at the Biosphere II: Computation and data in the wild. Poster presentation at iConference, Newport Beach, CA, March 25-26, 2015.
“Identifying and curating astronomical “dark data” through development of cyberinfrastructure resources", Library and Information Services in Astronomy VIII (LISA VIII), June 7-9, 2017, Strasbourg, France.
“Exploring the 'Long Tail' of Astronomy Data through Cyberinfrastructure Development”, Invited Colloquia Talk, Center for Digital Society and Data Studies / University of Arizona School of Information, Tucson, AZ, February 3, 2017
“Navigating Astronomical Dark Data through Advanced Cyberinfrastructure”, American Astronomical Society Author-Referee Workshops at Division for Planetary Sciences Annual Meeting and California Institute of Technology, Pasadena, CA, October 2016
AWARDS AND HONORS:
PhD Student Travel Grant ($800) awarded by the University of Arizona School of Information, June 2017
Research Assistantship funded by the Astrolabe Project, through UA Accelerate for Success grant, and NSF SI2-SSE award #1642446, “Visualizing Astronomy Repository Data using WorldWide Telescope Software Systems”, 2016-2017 (participated in writing proposals)
Full scholarship and travel support awarded by National Optical Astronomy Observatory to attend the La Serena School of Data Science in La Serena, Chile, August 2015
Research Development Fellowship awarded by University of Arizona Office for Research & Discovery, 2015-2016 academic year
Graduate and Professional Student Council (GPSC) Research and Project Grant awarded for purchase of equipment ($1065.00), December 2014