Comprehensive List of Solr Fields & Operators

This is an extensive list of methods of querying the ADS system. It is a technical document and you probably don’t need to read it if you are the average ADS user.

Solr (Virtual) Fields, Operators, and Other Stuff

An aggregated list of the fields, operators, and other parameters that are accessible from Solr. Descriptions of what they are used for, and why or where they should or should not be shown to users.

Field Name Deprecated Useful for user Show/Give access? [y/n] Explanation            
abstract n y y the abstract of the record            
ack n y y Contains acknowledgements extracted from fulltexts (if it was possible to identify them).            
aff n y y List of affiliations in a given paper            
alternate_bibcode n y y List of alternate bibcodes for that document            
alternate_title n y y Alternate title, usually when the original title is not in English            
arxiv_class n y y Which arXiv class was the paper submitted to            
author n y y List of authors on a paper            
author_facet n y&n y&n Contains list of names with the number of occurences that author has for the search (can only be searched)            
author_facet_hier n n n Hiearchical facet field which contains author/author_id.            
author_norm n n n List of authors with their first names shortened?            
bibcode n y y ADS identifier of a paper            
bibgroup n y y Bibliographic group that the bibcode belongs to (maintained by librarians of the relevant group)            
bibgroup_facet n y&n y&n Contains list of groups with the number of occurences that group has for the search            
bibstem n y y the abbreviated name of the journal or publication, e.g., ApJ.            
bibstem_facet n n n Technical field, used for faceting by publication. It contains only bibstems without volumes (eg. Sci)            
body n y n Contains extracted fulltext minus acknowledgements section            
citation n y n List of bibcodes that cite the paper            
citation_count n y y number of citations the item has received            
cite_read_boost n n n Float values containing normalized (float) boost factors. These can be used with functional queries to modify ranking of results.            
classic_factor n n n Integer values containing the boost factor used by ADS Classic. In essence log(1 + cites + norm_reads) where number of citations has been normalized and the whole value is multiplied by 5000 and then cast to Integer.            
comment n y tbd This is currently index, but not stored. To see the values, Solr needs to also store it.            
copyright n y y Copyright given by the publisher            
data n y y List of sources that have data related to this bibcode            
data_facet n y&n y&n Contains list of data with the number of occurences that data has for the search            
database n y y Databases that the paper resides in            
date n n n Same as pubdate, but of time format and used for indexing; too excessive for a user to care about/use            
doctype n n n Type of document: article, thesis, etc, these stem from classic            
doi n y y Digital object identifier            
eid n y y electronic id of the paper (equivalent of page)            
email n y n List of e-mails for the authors that included them in the article; protect due to privacy concerns - difficult to stop unless we stop people seeing it now            
facility n y tbd List of facilities declared in paper (controlled list for AAS journals so far). Needs to be added in ADSImportpipeline and Solr schemas            
first_author n y y First author of the paper            
first_author_facet_hier n y&n y&n Contains list of first names with the number of occurences that author has for the search, contains author/author_id            
first_author_norm n n n First author of the paper with their first name shortened?            
grant n y y Field that contains both grant ids and grant agencies.            
grant_agencies y n n Index with just the grant agencies names (e.g. NASA).            
grant_facet_hier n n n Hiearchical facet field which contains grant/grant_id. This field is not suitable for user queries, but rather for UI components. Term frequencies and positions are deactivated.            
grant_id n n n Index with just the grant ids (e.g. 0618398)            
id n n y a unique integer for this record. Generally not useful, but used by Andy Casey’s python client            
identifier n y y an array of alternative identifiers for the record. May contain alternative bibcodes, DOIs and/or arxiv ids.            
ids_data ? ? ? https://github.com/adsabs/issues/issues/73            
indexstamp n y y Date at which the document was indexed by Solr YYYY-MM-DD’T’hh:mm:ss.SSS’Z’            
isbn n y tbd ISBN of the publication (this applies to books)            
issn n y tbd ISSN of the publication (applies to journals - ie. periodical publications)            
issue n y y Issue of the journal that the article belongs to            
keyword n y y an array of normalized and un-normalized keyword values associated with the record            
keyword_facet n y&n y&n Contains list of keywords with the number of occurences that keyword has for the search            
keyword_norm n n n Controlled keywords, each entry will have a corresponding keyword_schema entry            
keyword_schema n y y Schema for each controlled keyword, i.e., what was the origin of the keyword?            
lang n y y In ADS this field contains a language of the main title. Currently, this value is present in a very small portion of records (try searching for lang:*)            
links_data n n n We use it to contain info on what readable linked documents are available? - If true, I think unimportant for users            
orcid_other n y y ORCID claims from users who used Bumblebee claiming interface, but did not give us consent to show their profiles            
orcid_pub n y y ORCID IDs supplied by publishers            
orcid_user n y y ORCID claims from users who gave ADS consent to expose their public profiles.            
page n y y String value (possibly two) indicating page range.            
property n y y an array of miscellaneous flags associated with the record. Possible values include: ARTICLE, REFEREED, NOT_REFEREED, INPROCEEDINGS, OPENACCESS, NONARTICLE, EPRINT, BOOK, PROCEEDINGS, CATALOG, SOFTWARE            
pub n y y the canonical name of the publication the record appeared in            
pub_raw n n n Name of publisher, but also includes the volume, page, and issue if exists            
pubdate n y y publication date in the form YYYY-MM-DD (DD value will always be “00”) - May be interesting for people who want publication month            
read_count n y y number of times the record has been viewed within in a 90-day windows (ads and arxiv); len(readers)            
reader n y n List of identifiers for people who have read the article            
recid n n n Unique identifier of the document, Integer version of ‘id’ - this is more efficient for sorting, and range queries            
reference n y n List of references inside a paper            
simbad_object_facet_hier n y&n y&n The hierarchical facets consisting of object_type/object_id            
simbid n y y List of SIMBAD IDs within the paper. This has privacy restrictions            
simbtype n y y Keywords used to describe the SIMBAD type            
thesis ? ? ? https://github.com/adsabs/issues/issues/72            
title n y y the title of the record            
vizier n y y Keywords, “subject” tags from VizieR            
vizier_facet n y&n y&n Contains list of VizieR keywords with the number of occurences that keyword has for the search            
volume n y y Volume of the journal that the article exists in            
year n y y Year of publication            
Virtual Fields/Operators                    
abs n y y Combination of the following: abstract, title, keyword            
all n n n this combines the following: author_norm, alternate_title, bibcode, doi, identifier            
arxiv n y y query parser token            
citations() n y y returns list of citations from given papers            
citis() y n n Different implementation of citation(), less memory but slower            
classic_relevance() n n n Toy-implementation of the ADS Classic relevance score algorithm. You can wrap any query and obtain the hits sorted in the ADS Classic ways (sort of)            
full n y y this combines the following: title^2, abstract^2, body, keyword, ack            
instructive() y n n Synonym of reviews()            
joincitations() y n n Equivalent of citations() but implemented using lucene block-join            
joinreferences() y n n Equivalent of references() but implemented using lucene block-join            
orcid n y y this combined the following: orcid_pub, orcid_user, orcid_other            
pos() n y y The pos() operator allows you to search for an item within a field by specifying the position in the field. The syntax for this operator is pos(fieldedquery,position,[endposition]). If no endposition is given, then it is assumed to be endposition = position, otherwise this performs a query within the range [position, endposition].            
references() n y y returns list of references from given papers            
reviews() n y y returns the list of documents citing the most relevant papers on the topic being researched; these are papers containing the most extensive reviews of the field.            
reviews2() y n n Original implimentation of reviews            
topn() n y y Return the top N number of documents            
trending() n y y Trending – returns the list of documents most read by users who read recent papers on the topic being researched; these are papers currently being read by people interested in this field.            
useful() n y y Useful – returns the list of documents frequently cited by the most relevant papers on the topic being researched; these are studies which discuss methods and techniques useful to conduct research in this field.            
useful2() y n n What experts are citing; original implementation of useful() Field Name Deprecated Useful for user Show/Give access? [y/n] Explanation