Data export
Learn the details behind all the variables Unsub uses to forecast your costs and fulfillment
The data fields in the Download as Spreadsheet are described here:
Metadata
issn_l_prefixed
The primary ISSN we use for this journal (see issn.org Linking ISSNs) prefixed with "issn:"
issn_l
The primary ISSN we use for this journal (see issn.org Linking ISSNs)
title
The title of this journal.
issns
A JSON array of ISSNs for this journal
publisher
The name of the publisher of the journal. "null" if we do not know the name of the publisher.
OpenAlex uses the term "concept" - which is equivalent to "subject" here
subject
A single OpenAlex concept (equivalent to subject in Unsub). The concept is selected by filtering to level zero OpenAlex concepts, then selecting those with the maximum score. If there's more than one concept we then select one alphabetically, starting from "A". You may also want to look at subject_top_three due to this alphabetical selection. See OpenAlex docs for more information.
subject_top_three
The top three OpenAlex concepts. Selected using the top three level zero OpenAlex concepts based on their score for strength of association between the journal and the concept. See OpenAlex docs for more information.
subjects_all
All level zero OpenAlex concepts and their IDs in a JSON array. With the IDs one can look up more information on each concept by appending the ID to https://openalex.org
(e.g., https://openalex.org/C185592680). See OpenAlex docs for more information.
Summary
subscribed
True
if you currently have the journal modeled as "subscribed" in your scenario, False
if the journal is currently "unsubscribed"
is_society_journal
True
if the journal is a society journal; False
if not.
usage
The projected Usage of the journal by your institution. Usage is a measure of the journal's value to your institution. It is calculated as:
Usage of a journal =
Downloads from the journal
+ (Citations to the journal by your authors) * (citation weight)
+ (Authored papers in the journal) * (authorship weight)
cpu
This is the Cost per Use (CPU) of the journal. We use a more advanced variant of CPU (internally we call this Net Per Paid Use, or NCPPU). This CPU calculation is the net cost (subscription minus ILL), divided by Paid Use (Usage that can't be met with free sources). This is our most important measure of journal value--it summarizes the real value you're getting for your subscription dollar. This article has more details: How do we calculate cost-effectiveness (Cost Per Use, CPU)
cpu_rank
The journal's rank according to CPU, relative to other journals in this dataset. The most cost-effective journal has a rank of 1.
cost
The cost of this journal according to the Settings and Subscription status of the journals in this Scenario. It will be either Subscription Cost or ILL Cost (see next columns), depending on whether the journal is currently subscribed to or not in this scenario.
instant_usage_percent
The percent of Usage that is available instantly -- available as Open Access, PTA, or Subscription.
free_instant_usage_percent
The percent of Usage that can be fulfilled via Open Access or PTA. This excludes usage met only by ILL or Subscription.
Costs
subscription_cost
The title-by-title subscription cost of this journal. It includes a 'content fee' based on your settings, and annual price increases over the next five years (the cost shown is the average cost over the next five years). This data is based on public price lists unless you've uploaded a custom price list (available in a purchased account). This article has more details: How do we calculate title-by-title subscription costs?
ill_cost
The ILL subscription cost of this journal. It is based on the ILL transaction cost and your ILL Request Rate, both of which are adjustable parameters. This article has more details: How do we calculate ILL requests and ILL cost?
subscription_minus_ill_cost
The net cost of the subscription, which is the Subscription cost above minus the ILL cost (since almost all universities will offer a journal by ILL if they didn't subscribe). For some journals with very high Usage and low subscription prices this number is actually negative -- subscribing is *cheaper* than filling all the anticipated ILL requests.
Fulfillment
use_oa_percent
The percent of Usage that can be fulfilled via as Green, Hybrid, or Bronze Open Access. Some of this content can be excluded in the settings (see the Open Access page for more info).
use_backfile_percent
The percent of Usage that can be fulfilled via your PTA (Post-Termination Access) rights.
use_subscription_percent
Percent of Usage that can be fulfilled ONLY via title-by-title subscription in this scenario--after fulfilling requests via Open Access and PTA fulfillment wherever possible.
use_ill_percent
Percent of Usage that is fulfilled via ILL in this scenario. This is simply turnaways (Usage that can't be fulfilled by Open Access, PTA, or your title-by-title subscriptions) multiplied by your ILL Request Rate parameter. This article has more details: How do we calculate ILL requests and ILL cost?
use_other_delayed_percent
Percent of Usage that is fulfilled via Unknown paths in this scenario. This all Usage that isn't fulfilled by Open Access, PTA rights, title-by-title subscription or ILL. Instead, the user fulfills their information need some other way -- asks the author for a paper, asks a colleague, finds another similar paper that is good enough for their purposes, etc.
perpetual_access_years_text
The years when your institution has PTA (Post-Termination Access) to this title. Default: empty.
baseline_access_text
Optional text for those who have already left their Big Deals, to annotate which journals they'd previously subscribed to.
bronze_oa_embargo_months
For some journals the publisher automatically makes the content free to read after a certain number of months. For those journals, this column lists the age of the articles (in months) when they will automatically become free to read on the publisher's site.
Usage Components
downloads
The number of times people in your institution will download papers from this journal. We base this on your COUNTER statistics and literature on download decay curves (how often an article is to be downloaded based on how long ago it was published).
citations
The number of times that a paper authored by someone in your institution will cite a paper in this journal. This number is based on the citations patterns of the previous five years. The importance of this number to overall Usage is determined by the Citation Weight parameter.
authorships
The number of papers in this journal that have at least one author from your institution. This number is based on the authorship patterns of the previous five years. The importance of this number to overall Usage is determined by the Authorship Weight parameter.
Fuzzed
*_fuzzed
We include a few "fuzzed" columns, in case you want to share this data approximately but not exactly. For each column that's fuzzed, we ranked all the values in a column, then separated the values into three equal sized groups, and labelled them low, medium, and high. Low refers to the lowest value by rank, high refers to the highest value by rank, and medium in the middle. Where there is no value for the un-fuzzed value, we use a dash ("-").
If you want more fine grained resolution among titles you can refer to the un-fuzzed columns.
Last updated