@Kelly Stathis just shared with me (in #community > DataCite Usage Tracker ) that you can easily get a list of the 25 most cited datasets in your installation of Dataverse through the DataCite API.
Here's an example for Harvard Dataverse:
curl 'https://api.datacite.org/dois/?client-id=gdcc.harvard-dv&sort=-citation-count' | jq '.data[].attributes | "\(.citationCount) \(.url)"' -r
144 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/OHHUKH
34 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/VOQCHQ
32 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/28075
26 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/LM4OWF
25 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/GDF6Z0
25 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/PRFF8V
18 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/4JQRCL
16 https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/ZTPW0Y
16 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/LEJUQZ
14 https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/PE8TWP
13 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/HTTWYL
11 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/28468
10 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/ZSBZ7K
9 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/DBW86T
9 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/LZHMG3
9 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/TDOAPG
8 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/42MVDX
8 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/WMGTNS
8 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/TTMZ08
8 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/3WZFK9
8 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/26147
7 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/NRR7MB
7 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/E9N6PH
7 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/AMRXJA
7 https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/25833
I just spread the word about this: https://groups.google.com/g/dataverse-community/c/Q2PbdEaZjaA/m/vWSiGBtaBQAJ
As I wrote in that post, I'm a bit confused about how to get the client-id. It would be nice if I could see it at https://commons.datacite.org/repositories/x3oc4vr for Harvard Dataverse, for example.
Hi @Philip Durbin thanks for sharing this!
To find the client-id from a Commons repository page, if you click "Find Related Works" on https://commons.datacite.org/repositories/x3oc4vr it leads to this Works search result page: https://commons.datacite.org/doi.org?query=client.uid:gdcc.harvard-dv where you can see the client-id in the URL as "client.uid". I've shared with the team that it could be helpful to expose this in the interface as well.
@Kelly Stathis thanks!
I spot-checked a few Dataverse installations from our list of 111 and "find related works" is absent in many cases:
Do you know why this is? I do see "find related works" for these:
It's a great trick when it works!
If it helps, in that list we also track the DOI authority. Maybe we can look up the client-id that way?
In our case (Cirad), "find related works" is absent on our repository page, but we can find it within our organization. My guess is that our client ID is linked to our organization rather than directly to our repository. I can retrieve citations from the dataset using your curl command example; it's just mixed up with other information related to our organization.
Hi @Philip Durbin, there are a few different things going on here:
The repository page for Odum Institute Archive Dataverse https://commons.datacite.org/repositories/3zncw48 is coming from this re3data record: http://doi.org/10.17616/R31NJCHT.
The DataCite repository account for Odum Institute Dataverse has client-id GDCC.ODUM-DV. In their Fabrica settings, the repository is linked to the re3data record https://doi.org/10.17616/R3RP41, for UNC Dataverse. This has a different page in Commons: https://commons.datacite.org/repositories/vl6ce78 on which you'll see the Find Related Works link.
For Texas Data Repository, the DataCite repository account (TDL.TDL) doesn't have a re3data record specified. In Commons this is the page: https://commons.datacite.org/repositories/9z2cveo which is not matched to anything in re3data. To resolve this, the repository can add the corresponding re3data identifier in the Fabrica settings.
The UCLA Social Science Data Archive is a different situation. This appears to be using one prefix (10.25346) underneath the CDL.UCLA repository account, which has a total of 17 prefixes for different systems. I know CDL was investigating splitting these out into separate DataCite repository accounts—once that is complete, they could associate the specific repository account (say "CDL.UCLA-DV" or whatever the new client-id is) with the re3data identifier. Until then, all the UCLA DOIs show up here: https://commons.datacite.org/repositories/zr3cr9, which corresponds to the entire CDL.UCLA DataCite repository account.
To summarize how this works in general:
Holy moly, this is getting complicated! Thanks for the writeup, @Kelly Stathis !
@Gwenaël Doux For Cirad, you can update your settings here: https://doi.datacite.org/repositories/inist.cirad/settings to add the "re3data Record" - search for "CIRAD Dataverse". This should merge your current page in Commons (https://commons.datacite.org/repositories/zr3c27) with the one derived from the re3data record (https://commons.datacite.org/repositories/or2c783). Let me know how this goes!
What about the DOI authority? We track it in our spreadsheet of Dataverse installations. If I tell you 10.18419 can you (or some API) tell me the client-id?
Yes! https://api.datacite.org/prefixes/10.18419
tib.ubs in that case! Perfect! Thanks!
Thanks again. I finally got around to explaining this at https://groups.google.com/g/dataverse-community/c/Q2PbdEaZjaA/m/ogomK2RSAAAJ
Last updated: Nov 01 2025 at 14:11 UTC