Announcing the NASA Science Explorer
Alberto Accomazzi (ADS Principal Investigator)
19 Apr 2023
Did you know that 2023 is the year of Open Science in the United States, and that NASA is leading the effort with its Open-Source Science Initiative (OSSI)? The initiative aims to make publicly funded scientific research transparent, inclusive, accessible, and reproducible. The NASA Astrophysics Data System (ADS) is participating in the effort by extending its discovery platform to all research funded by the NASA Science Mission Directorate (SMD) divisions. The ADS was selected to lead this effort based on its decade-long, groundbreaking efforts in supporting the goals of Open Science for the Astronomy and Astrophysics community.
An initial expansion of the ADS scope took place in 2021, when NASA asked the project to incorporate research content published in Heliophysics and Planetary Science, two SMD disciplines closely related to Astrophysics. In 2022, the agency approved an additional proposal for an expansion into the two remaining SMD disciplines (Earth Science and Biological and Physical Sciences).
The expansion plan, which has now begun in earnest, involves the creation of a literature-based, open digital information system covering and unifying the fields of Astrophysics, Planetary Science, Heliophysics, and Earth Science. It will also cover NASA funded research in Biological and Physical Sciences. Codenamed NASA Science Explorer, or SciX for short, it will extend ADS to become a permanent major component in the infrastructure of scientific research, providing important contributions towards the goal of open science.
Role and impact of the
NASA Astrophysics Data System
For nearly 30 years the NASA Astrophysics Data System has been a core component in the intellectual infrastructure of astrophysics; for more than two decades it has dominated the search and discovery process for the astronomical research literature. It is used daily by nearly every research astronomer, worldwide. In keeping with NASA’s long term commitment to open science, the ADS services have always been open and freely accessible to everyone on Earth. By integrating and linking to hundreds of data sources, such as publishers, archives, data centers, and libraries, the ADS team has created a powerful international, multi-partner research discovery engine, unified under a single digital library system.
Having an open, sophisticated digital library has had an enormous impact on astronomy research. Kurtz et al (2005) used the ADS logs from 2002 and estimated that use of the ADS increased the efficiency of the astronomical research process by an amount equivalent to 6% of its total research budget. In 2021, 6% of all articles in the major astronomy journals explicitly acknowledged the ADS, an amount similar to the papers acknowledging use of or funding from the Hubble Space Telescope, NASA’s most successful Astrophysics Mission to date (see figure 1).
Figure 1. Acknowledgments found in the scholarly literature to ADS, and NASA’s most prominent astrophysics missions: the Hubble Space Telescope (HST), and the Chandra X-ray Telescope. HST and Chandra numbers include required acknowledgments of research grant funding.
Distinguishing features of ADS are its authoritative and unique coverage of the relevant literature, comprehensive citation network, efficient curation efforts, responsive user support, and integration of discipline-specific knowledge and capabilities. This allows scientists to search the ADS using familiar language and concepts, knowing that the system will properly interpret their request through e.g. discipline-specific term normalization, as well as synonym and acronym expansion.
Announcing the NASA Science Explorer
Building a new digital library to serve all five SMD disciplines represents a major endeavor with technological challenges and opportunities. The experience gained from operating the ADS for the past 30 years shows the importance of having a system providing the ability to discover all relevant content in a particular discipline. The inclusion of additional research areas and the need to support interdisciplinary research means that the system needs to be extended in a way that allows both breadth and depth, with the possibility of performing deep dives on particular research topics as well as discovering cross-disciplinary connections through citations and co-readership networks.
To accomplish this, NASA SciX will combine ADS’s current scalable, discipline-agnostic infrastructure with a set of discipline specific knowledge centers which will curate and enrich its content using the deep subject matter expertise which has been crucial to the success of the ADS. The ultimate goal of this effort is to create a digital library where:
- All discipline-specific research content is aggregated, connected, and indexed for each of the SMD divisions;
- Relevant taxonomies are used to capture the knowledge and semantics of the subject disciplines;
- Curation and machine learning-based text mining and enrichment are combined in a platform that is designed to scale without sacrificing accuracy and flexibility;
- Digital collections are enriched with links to other research objects such as datasets, software, notebooks, and funding information;
- Discipline-specific capabilities and analytic services are exposed to the relevant research communities;
- Discoverability and access to NASA-funded research artifacts and derived data products are available to all from a public search portal;
- New and existing initiatives are developed and supported in collaboration with NASA and other research organizations.
The initial construction phase of NASA SciX will be completed in three years, by which time the system will be fully operational for all the included disciplines. Content from the new disciplines will be ingested and new agreements necessary for its operations will have been established, including collaborations with publishers, societies, missions, archives, and other data centers. New capabilities will have been developed, new discipline-specific features will have been built, and the project will have a new, permanent home as one of NASA’s research portals.
In order to meet the goals of scaling up its data holdings and capabilities to serve the entirety of SMD-funded research, NASA SciX will leverage new developments in AI and Machine Learning to aid its curation efforts. The NASA SciX platform will exist in a state of continuous improvement, following the example of the ADS, with new features and content being added in response to the needs of the community it serves.
NASA SciX will be a resource open to the worldwide research community as well as the general public. It will feature a responsible website and an open API built on the existing ADS infrastructure. Following a spirit of transparency and community embodied by NASA’s Open Source Science initiative, all of NASA SciX’s software development efforts, training datasets, and Open Access content will be made available to the public for further re-use and development.
If you are a long-time user of the ADS, rest assured that the system that you have learned to depend on for your daily research needs is not going to disappear or become more complicated. Rather, the launch of SciX will bring new features that improve the functionality and usability of the system, while still allowing users to rely on the familiar ADS search interface. Stay tuned for more news!