Stream: community

Topic: downloading all files from dataset with private URL


view this post on Zulip Péter Pallinger (Jan 29 2026 at 14:05):

If a reviewer gets a private URL to an unpublished dataset, they may want to download multiple files from it, but the only way to do that currently seems to be through the web interface. However, the zipped download does not work for a larger total file size.
API download does not work. Even though you _can_ get the file list for the unpublished dataset through the API, downloading the files is forbidden (it seems to me, that it requires a JSESSIONID cookie).
Is there a true and tried way to download large datasets using private/preview URLs?

view this post on Zulip Philip Durbin 🚀 (Jan 29 2026 at 16:15):

You should be able to use the preview URL token as an API token. Does that help?

view this post on Zulip Dieuwertje Bloemen (Jan 29 2026 at 16:15):

It's quite a coincidence, because this week we finally have a production ready Globus integration that can also do exactly this :sweat_smile: . My colleague Eryk is working on making his work available on GitHub. Hopefully tomorrow or next week we can share it. We worked on improving Globus so we can use it for restricted access data and with private urls. The only thing that isn't working is anonymized private urls (the ones where the author name etc. gets redacted). I'll drop the url to the GitHub once I have it.

view this post on Zulip Péter Pallinger (Jan 30 2026 at 11:09):

Philip Durbin 🚀 said:

You should be able to use the preview URL token as an API token. Does that help?

That is a good tip, thank you. I did not see it in the documentation... I already implemented a workaround by getting a JSESSIONID, but this would be cleaner.

view this post on Zulip Philip Durbin 🚀 (Jan 30 2026 at 11:16):

Please feel free to open an issue to document this!

view this post on Zulip Dieuwertje Bloemen (Jan 30 2026 at 15:01):

For the Globus work, Eryk just released the new version of the rdm-integration (https://github.com/libis/rdm-integration) which includes Globus working for private urls.

view this post on Zulip Dieuwertje Bloemen (Jan 30 2026 at 15:02):

(we've also submitted the work for a presentation at the DCM26)

view this post on Zulip Philip Durbin 🚀 (Jan 30 2026 at 15:03):

Fantastic!

view this post on Zulip Philip Durbin 🚀 (Jan 30 2026 at 15:03):

Do you want it in the 6.10 release notes? :thinking:

view this post on Zulip Dieuwertje Bloemen (Jan 30 2026 at 15:41):

Sure, that would be great.

view this post on Zulip Philip Durbin 🚀 (Jan 30 2026 at 15:45):

Can you please ask Eryk to make a release note snippet?

view this post on Zulip Philip Durbin 🚀 (Jan 30 2026 at 15:46):

Or you can, if you feel up to it. Here's how: https://guides.dataverse.org/en/6.9/developers/version-control.html#writing-a-release-note-snippet

view this post on Zulip Dieuwertje Bloemen (Jan 30 2026 at 16:25):

yeah, we'll have a look.

view this post on Zulip Philip Durbin 🚀 (Feb 03 2026 at 16:48):

@Péter Pallinger thanks for PR #12140! :heart: Lemme know if you need any help with it!

view this post on Zulip Péter Pallinger (Feb 04 2026 at 07:42):

No, thank you, I just did not manage to finish it yesterday...

view this post on Zulip Péter Pallinger (Feb 04 2026 at 15:38):

I finalized the PR after making sure make html runs. Some references had to be fixed. Also merged the current develop branch.

view this post on Zulip Philip Durbin 🚀 (Feb 04 2026 at 15:41):

Yes, now the preview works: https://dataverse-guide--12140.org.readthedocs.build/en/12140/api/auth.html#using-preview-url-tokens-as-api-token

view this post on Zulip Philip Durbin 🚀 (Feb 04 2026 at 15:46):

@Péter Pallinger I just left you a review.

view this post on Zulip Péter Pallinger (Feb 05 2026 at 08:36):

I moved the section, you can review it again!

view this post on Zulip Philip Durbin 🚀 (Feb 17 2026 at 15:55):

Merged! Thanks again!


Last updated: Apr 03 2026 at 06:08 UTC