Hello ,
I'm seeking ways to increase the visibility of metadata stored in our Dataverse repository and I'm interested in integrating it with Google Scholar. Could someone provide guidance or tips on how to configure Dataverse to automatically share metadata with Google Scholar? I appreciate any help or suggestions you can offer!
Hi! A good place to start is https://guides.dataverse.org/en/6.1/installation/config.html#letting-search-engines-crawl-your-installation
In short, you definitely need to make sure robot.txt is not blocking crawlers.
And you should set up your sitemap and submit it to Google.
This is to set up Google Dataset Search. I am not sure if Google Scholar does "data".
Oh! Google Scholar! Yes, I agree. I don't think it does data. Not sure.
I've already configured the robots.txt and sitemap.xml, and Google is indexing our Dataverse repository. Now, my goal is to ensure that the metadata within Dataverse is indexed and found when someone performs a search on Google Scholar. I'm seeking specific guidance on how to configure Dataverse to achieve this. I appreciate any additional help or suggestions
Hmm. I'm not sure it's possible. Google Scholar is for articles, not data.
Hi @Marcos Anjos and @Philip Durbin ,
e-cienciaDatos is in Google Scholar via DataCite. The problem is that DataCite metadata don't include some important fields as rights, subject, or language.
I didn't create a new issue because there are already several in progress issues about the metadata.
We have done some changes in DataCite Metadata that work for us, but they are not tested thoroughly.
You can see the Datacite metadata related to languages, licenses and subjects in the Harvard Dataverse or other Dataverse based repositories:
Dataverse: https://commons.datacite.org/repositories/x3oc4vr
image.png
Sorry, you are right @Sherry Lake and @Philip Durbin . Our datasets are not in Google Scholar.
I have made a mistake with Google Dataset Search
The naming is confusing! :sweat_smile:
Philip Durbin said:
The naming is confusing! :sweat_smile:
reminds me of struggling to figure out what GDCC meant-- my brain kept thinking gcc. could be good to have a non-acronymised name that can't be confused...:grinning_face_with_smiling_eyes:
I sometimes get the number of C's wrong. Too many or too few. :sweat_smile:
Last updated: Nov 01 2025 at 14:11 UTC