Comprehensive List of Solr Fields & Operators
This is an extensive list of methods of querying the ADS system. It is a technical document and you probably don’t need to read it unless you are interested in performing advanced searches.
Solr (Virtual) Fields, Operators, and Other Stuff
An aggregated list of the fields, operators, and other parameters that are accessible from Solr. Descriptions of what they are used for, and why or where they should or should not be shown to users.
Field Name | Deprecated | Searchable | Viewable | Explanation |
---|---|---|---|---|
abstract | y | y | the abstract of the record | |
ack | y | y | Contains acknowledgements extracted from fulltexts (if identified in article). | |
aff | y | y | List of provided affiliations in a given paper | |
aff_id | y | n | List of curated affiliation IDs in a given paper | |
alternate_bibcode | y | y | List of alternate bibcodes for that document | |
alternate_title | y | y | Alternate title, usually when the original title is not in English | |
arxiv_class | y | y | Which arXiv class was the paper submitted to | |
author | y | y | List of authors on a paper | |
author_count | y | y | Number of authors on a paper | |
author_facet | y | n | Contains list of names with the number of occurences that author has for the search (can only be searched) | |
author_facet_hier | y | n | Hierarchical facet field which contains author/author_id. | |
author_norm | y | y | List of authors with their first names shortened? | |
bibcode | y | y | ADS identifier of a paper | |
bibgroup | y | y | Bibliographic group that the bibcode belongs to (curated by staff outside of ADS) | |
bibgroup_facet | y | y | Contains list of groups with the number of occurrences that group has for the search | |
bibstem | y | y | the abbreviated name of the journal or publication, e.g., ApJ. Full lists of bibstems can be found here | |
bibstem_facet | y | n | Technical field, used for faceting by publication. It contains only bibstems without volumes (eg. Sci) | |
body | y | n | Contains extracted fulltext minus acknowledgements section | |
citation | y | y | List of bibcodes that cite the paper | |
citation_count | y | y | number of citations the item has received | |
cite_read_boost | y | y | Float values containing normalized (float) boost factors. These can be used with functional queries to modify ranking of results. | |
classic_factor | y | y | Integer values containing the boost factor used by ADS Classic. In essence log(1 + cites + norm_reads) where number of citations has been normalized and the whole value is multiplied by 5000 and then cast to Integer. | |
comment | y | tbd | This is currently indexed, but not stored. To see the values, Solr needs to also store it. | |
copyright | y | y | Copyright given by the publisher | |
data | y | y | List of sources that have data related to this bibcode | |
data_facet | y | n | Contains list of data with the number of occurrences that data has for the search | |
database | y | y | Database that the paper resides in (astronomy or physics) | |
date | y | y | Same as pubdate, but of time format and used for indexing; too excessive for a user to care about/use | |
doctype | y | y | Type of document: article, thesis, etc, these stem from classic | |
doctype_facet_hier | y | n | Hierarchical facets consisting of nested document types | |
doi | y | y | Digital object identifier | |
eid | y | y | electronic id of the paper (equivalent of page number) | |
y | n | List of e-mails for the authors that included them in the article; can be viewed only with elevated privileges | ||
entdate | y | y | Creation date of ADS record in user-friendly format (YYYY-MM-DD) | |
entry_date | y | y | Creation date of ADS record in RFC 3339 (machine-readable) format | |
esources | y | y | Types of electronic sources available for a record (e.g. pub_html, eprint_pdf) | |
facility | y | tbd | List of facilities declared in paper (controlled list for AAS journals so far). Needs to be added in ADSImportpipeline and Solr schemas | |
first_author | y | y | First author of the paper | |
first_author_facet_hier | y | n | Contains list of first names with the number of occurrences that author has for the search, contains author/author_id | |
first_author_norm | y | y | n | First author of the paper with their first name shortened? |
grant | y | y | Field that contains both grant ids and grant agencies. | |
grant_agencies | y | y | y | Index with just the grant agencies names (e.g. NASA). |
grant_facet_hier | y | n | Hierarchical facet field which contains grant/grant_id. This field is not suitable for user queries, but rather for UI components. Term frequencies and positions are deactivated. | |
grant_id | y | y | y | Index with just the grant ids (e.g. 0618398) |
id | y | y | a unique integer for this record. Generally not useful, but used by Andy Casey’s python client | |
identifier | y | n | Abstract field that can be used to search an array of alternative identifiers for the record. May contain alternative bibcodes, DOIs and/or arxiv ids. | |
ids_data | ? | ? | ? | https://github.com/adsabs/issues/issues/73 |
indexstamp | y | y | Date at which the document was indexed by Solr YYYY-MM-DD’T’hh:mm:ss.SSS’Z’ | |
inst | y | n | List of curated affiliations (institutions) in a given paper | |
isbn | y | tbd | ISBN of the publication (this applies to books) | |
issn | y | tbd | ISSN of the publication (applies to journals - ie. periodical publications) | |
issue | y | y | Issue number of the journal that includes the article | |
keyword | y | y | an array of normalized and non-normalized keyword values associated with the record | |
keyword_facet | y | n | Contains list of keywords with the number of occurrences that keyword has for the search | |
keyword_norm | y | y | Controlled keywords, each entry will have a corresponding keyword_schema entry | |
keyword_schema | y | y | Schema for each controlled keyword, i.e., what was the origin of the keyword? | |
lang | y | y | In ADS this field contains a language of the main title. Currently, this value is present in a very small portion of records (try searching for lang:*) | |
links_data | y | y | y | We use it to contain info on what readable linked documents are available |
nedid | y | y | List of NED IDs within a record | |
nedtype | y | y | Keywords used to describe the NED type (e.g. galaxy, star) | |
nedtype_object_facet_hier | y | n | Hierarchical facet consisting of NED object type and ID | |
orcid_other | y | y | ORCID claims from users who used the ADS claiming interface, but did not give us consent to show their profiles | |
orcid_pub | y | y | ORCID IDs supplied by publishers | |
orcid_user | y | y | ORCID claims from users who gave ADS consent to expose their public profiles. | |
page | y | y | First page of a record | |
page_count | y | y | If page_range is present, gives the difference between the first and last page numbers in the range | |
page_range | n | n | Range of page numbers covered by the record | |
property | y | y | an array of miscellaneous flags associated with the record. Possible values include: ARTICLE, REFEREED, NOT_REFEREED, INPROCEEDINGS, OPENACCESS, NONARTICLE, EPRINT, BOOK, PROCEEDINGS, CATALOG, SOFTWARE | |
pub | y | y | the canonical name of the publication the record appeared in | |
pub_raw | y | y | Name of publisher, but also includes the volume, page, and issue if exists | |
pubdate | y | y | publication date in the form YYYY-MM-DD (DD value will always be “00”) - May be interesting for people who want publication month | |
pubnote | n | y | Comments submitted with the arXiv version of the paper | |
read_count | y | y | number of times the record has been viewed within in a 90-day windows (ads and arxiv); len(readers) | |
reader | y | n | List of identifiers for people who have read the article | |
recid | y | y | Unique identifier of the document, Integer version of ‘id’ - this is more efficient for sorting, and range queries | |
reference | y | y | List of references inside a paper | |
simbad_object_facet_hier | y | y | The hierarchical facets consisting of object_type/object_id | |
simbid | y | y | List of SIMBAD IDs within the paper. This has privacy restrictions | |
simbtype | y | y | Keywords used to describe the SIMBAD type | |
thesis | ? | ? | ? | https://github.com/adsabs/issues/issues/72 |
title | y | y | the title of the record | |
vizier | y | y | Keywords, “subject” tags from VizieR | |
vizier_facet | y | y | Contains list of VizieR keywords with the number of occurences that keyword has for the search | |
volume | y | y | Volume of the journal that the article exists in | |
year | y | y | Year of publication | |
Virtual Fields/Operators | ||||
abs | y | n | Combination of the following: abstract, title, keyword | |
all | n | n | this combines the following: author_norm, alternate_title, bibcode, doi, identifier | |
arxiv | y | n | query parser token | |
citations() | n | y | y | returns list of citations from given papers; use [citations] to get the field contents |
citis() | y | y | - | Different implementation of citation(), less memory but slower |
classic_relevance() | n | n | - | Toy-implementation of the ADS Classic relevance score algorithm. You can wrap any query and obtain the hits sorted in the ADS Classic ways (sort of) |
full | y | n | this combines the following: title^2, abstract^2, body, keyword, ack | |
instructive() | y | y | n | Synonym of reviews() |
joincitations() | y | y | - | Equivalent of citations() but implemented using lucene block-join |
joinreferences() | y | y | - | Equivalent of references() but implemented using lucene block-join |
orcid | y | n | this combined the following: orcid_pub, orcid_user, orcid_other | |
pos() | n | y | - | The pos() operator allows you to search for an item within a field by specifying the position in the field. The syntax for this operator is pos(fieldedquery,position,[endposition]). If no endposition is given, then it is assumed to be endposition = position, otherwise this performs a query within the range [position, endposition]. |
references() | n | y | y | returns list of references from given papers |
reviews() | n | y | - | returns the list of documents citing the most relevant papers on the topic being researched; these are papers containing the most extensive reviews of the field. |
reviews2() | y | - | n | Original implementation of reviews |
similar() | n | y | - | Return similar documents, based on the similarity of the abstract text |
topn() | n | y | - | Return the top N number of documents |
trending() | n | y | - | Trending – returns the list of documents most read by users who read recent papers on the topic being researched; these are papers currently being read by people interested in this field. |
useful() | n | y | - | Useful – returns the list of documents frequently cited by the most relevant papers on the topic being researched; these are studies which discuss methods and techniques useful to conduct research in this field. |
useful2() | y | y | - | What experts are citing; original implementation of useful() |
Field Name | Deprecated | Useful for user | Show/Give access? [y/n] |