Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update to 6.1 possibly caused loss of dataset thumbnails #10220

Closed
sergejzr opened this issue Jan 9, 2024 · 39 comments · Fixed by #10258
Closed

Update to 6.1 possibly caused loss of dataset thumbnails #10220

sergejzr opened this issue Jan 9, 2024 · 39 comments · Fixed by #10258
Assignees
Labels
Size: 30 A percentage of a sprint. 21 hours. (formerly size:33) Type: Bug a defect
Milestone

Comments

@sergejzr
Copy link

sergejzr commented Jan 9, 2024

Dear Dataverse Developers,

It seems that after updating 6.0 to 6.1there are several issues with thumbnails on our system. For example the manually created dataset thumbnails are gone and automatically created ones are low resolution. If I manually add large thumbnails, they are also large in the result list, destroying the web page. Here detailed cases with links to our test-system:

Case 1:
Disappeared icon: https://bonndata-test.hrz.uni-bonn.de/dataset.xhtml?persistentId=doi:10.80624/FK2/9S7RJF
Icon added in 6.0, disappeared after update to 6.1. The icon however it still there if I open edit the thumbnail of the dataset. It does not appear even after editing the metadata and re-publishing

Case 2:
Auto-generated: https://bonndata-test.hrz.uni-bonn.de/dataset.xhtml?persistentId=doi:10.80624/FK2/WOX4YV
The thumbnail was automatically generated from pdf of the dataset files in 6.0. After update to 6.1 it became tiny resolution. Generally all automatically generated icons have tiny resolution now....

Case 3:
Dataset with manually added large thumbnail in 6.1. https://bonndata-test.hrz.uni-bonn.de/dataset.xhtml?persistentId=doi:10.80624/FK2/0OBMAR
The thumbnail appear ok in the dataset page, but in search results it is remaining huge. https://bonndata-test.hrz.uni-bonn.de/dataverse/sandbox

Since I am not sure whether there could be other source of error (I could not find any yet) I would like just to ask those who did update to check if they encounter the same..

PS: At the moment, I rolled back our productive installation back to 6.0 and try to find the cause on the test system.

Best regards
Sergej

@sergejzr sergejzr added the Type: Bug a defect label Jan 9, 2024
@sergejzr
Copy link
Author

sergejzr commented Jan 9, 2024

I would close it now, it might be not all icons were recreated and the recreation just needs time...

@pdurbin
Copy link
Member

pdurbin commented Jan 10, 2024

Hmm, this is probably due to this PR:

Should we add something to the release notes?

@sergejzr
Copy link
Author

sergejzr commented Jan 12, 2024

Hi Phil,
I finally managed to do some more tests. There are several issues with icons, i will update my first post and re-open if possible.

Case 1:
Disappeared icon: https://bonndata-test.hrz.uni-bonn.de/dataset.xhtml?persistentId=doi:10.80624/FK2/9S7RJF
Icon added in 6.0, disappeared after update to 6.1. The icon however it still there if I open edit the thumbnail of the dataset. It does not appear even after editing the metadata and re-publishing

Case 2:
Auto-generated: https://bonndata-test.hrz.uni-bonn.de/dataset.xhtml?persistentId=doi:10.80624/FK2/WOX4YV
The thumbnail was automatically generated from pdf of the dataset files in 6.0. After update to 6.1 it became tiny resolution.

Case 3:
Dataset with manually added large thumbnail in 6.1. https://bonndata-test.hrz.uni-bonn.de/dataset.xhtml?persistentId=doi:10.80624/FK2/0OBMAR
The thumbnail appear ok in the dataset page, but in search results it is remaining huge. https://bonndata-test.hrz.uni-bonn.de/dataverse/sandbox

As I can see, at the moment there are almost no servers running 6.1, except https://data.qdr.syr.edu/ But there are no manually added icons, I suppose.

Interestingly, if I manually add thumbnail to the demo system: https://demo.dataverse.org/dataset.xhtml?persistentId=doi:10.70122/FK2/38WUNS It does not appear at all , only if I edit the dataset (you can try, I guess, you have admin account there). https://demo.dataverse.org/dataset-widgets.xhtml?datasetId=2113302
Here the test file that should be used as thumbnail:
largetests

And it looks ok, if automatically generated:
https://demo.dataverse.org/dataset.xhtml?persistentId=doi:10.70122/FK2/CSWVJB

@landreev
Copy link
Contributor

The bug where a manually-added dataset thumbnail is showing in full-res in the search results, effectively breaking the page - is that reproducible on demo.dataverse.org? Or is that something that only happens with existing thumbnails, when upgrading to 6.1?

@sergejzr
Copy link
Author

sergejzr commented Jan 17, 2024

Thats true, I could not reproduce it at dataverse.org. At our system it applies to newly added thumbnails.
Also I noticed that in 6.0 the preview Thumbnails were bigger in the dataset view. In 6.1 both, dataset view and search list, the thumbnail size is 48x48 (which is too small for dataset view)

@landreev
Copy link
Contributor

Also I noticed that in 6.0 the preview Thumbnails were bigger in the dataset view. In 6.1 both, dataset view and search list, the thumbnail size is 48x48 (which is too small for dataset view)

I'm a bit confused by this part of your report - from looking both at the code, and at our own 6.0 instance here at IQSS, it really looks like the dataset-level thumbnails, the ones at the top of the dataset page, have always been 48 px.

Was your 6.0 instance a modified, forked version by any chance?

@landreev
Copy link
Contributor

I am very concerned about the other bug, the huge "thumbnail", if you can call it that, that messes up the search results page. Trying to reproduce it locally...

@landreev
Copy link
Contributor

Was your 6.0 instance a modified, forked version by any chance?

Another possibility of course would be that the thumbnails you had cached on disk or s3 just happened to be higher-res (generated outside of Dataverse, for example - ?), and Dataverse was just using them happily instead of the standard 48x ones. But then 6.1 had them re-scaled from scratch - which would be a possibility, since that part of the application was modified in 6.1.

@landreev
Copy link
Contributor

... it really looks like the dataset-level thumbnails, the ones at the top of the dataset page, have always been 48 px.

Uhm, that doesn't appear to be true, actually. It looks like we did make it larger, and variable/configurable at some point. @pdurbin we may have missed this when we were talking about it yesterday (?).

This is all turning out to be way more confusing than it should be...

@landreev
Copy link
Contributor

I can reproduce the "giant thumbnail" issue locally. ☹️

@landreev
Copy link
Contributor

I can reproduce the issue with either filesystem or S3 storage. (And that of course means that it would affect us here at IQSS, if we deployed 6.1 this week, as we were planning to do).
One remaining mystery is why this thing couldn't be easily reproduced on demo.dataverse.org - but let me take another look into that.
Ouch.

@landreev
Copy link
Contributor

landreev commented Jan 17, 2024

... Interestingly, if I manually add thumbnail to the demo system: https://demo.dataverse.org/dataset.xhtml?persistentId=doi:10.70122/FK2/38WUNS It does not appear at all

OK, it appears that custom dataset logos do not work unless there are other images in the dataset. Obviously a bug in itself; even if not the most serious of the bugs we are looking at...

But yes, I uploaded another file into the dataset above, published it - and that reproduced the bug just as described. I'm attaching the screenshot from demo, but I will disable that custom thumbnail in the dataset, since we don't want the main page to stay messed up there.

Thank you for the report - this appears to be fairly serious.

Screen Shot 2024-01-17 at 11 25 19 AM

@sergejzr
Copy link
Author

Sorry to answer bit late. Seems you found the answers to most questions already.

Regarding:

Also I noticed that in 6.0 the preview Thumbnails were bigger in the dataset view. In 6.1 both, dataset view and search list, the thumbnail size is 48x48 (which is too small for dataset view)

I'm a bit confused by this part of your report - from looking both at the code, and at our own 6.0 instance here at IQSS, it really looks like the dataset-level thumbnails, the ones at the top of the dataset page, have always been 48 px.

Was your 6.0 instance a modified, forked version by any chance?

No, we always take the official releases for our productive system and I never changed anything regarding the the thumbnail settings... Here is one example on our productive system: https://bonndata.uni-bonn.de/dataset.xhtml?persistentId=doi:10.60507/FK2/KZHXDF

I also looked randomly over other installations, they all seem to have large thumbnails:
6.0: https://dataverse.asu.edu/dataset.xhtml?persistentId=doi:10.48349/ASU/LTWBGY
https://archaeology.datastations.nl/dataset.xhtml?persistentId=doi:10.17026/AR/QGVAI2
5.14:
https://data.aussda.at/dataset.xhtml?persistentId=doi:10.11587/K5OMQN
https://edmond.mpg.de/dataset.xhtml?persistentId=doi:10.17617/3.OYI3T9
5.11:
https://dataverse.csuc.cat/dataset.xhtml?persistentId=doi:10.34810/data1028

@landreev
Copy link
Contributor

I also looked randomly over other installations, they all seem to have large thumbnails ...

Yes, I was clearly wrong about that, sorry for the confusion.

This is also a bug, for sure. One extra confusing part, when reproducing it, is that the page shows the correct size (140px) thumbnail while the dataset is in DRAFT:

Screen Shot 2024-01-17 at 12 47 44 PM

... then switches to the tiny 48px size when the dataset is published:
Screen Shot 2024-01-17 at 1 44 41 PM

(everybody: once again, this is a different bug; the above only happens when one of the image files in the dataset is chosen as the dataset thumbnail; when a custom "logo" image is uploaded, that is subject to the other bug, as described earlier in the issue)

@cmbz
Copy link

cmbz commented Jan 17, 2024

2024/01/17: Added to This Sprint, applied size: 33 during the sprint kickoff

@cmbz cmbz added the Size: 30 A percentage of a sprint. 21 hours. (formerly size:33) label Jan 17, 2024
@cmbz cmbz moved this from SPRINT READY to This Sprint 🏃‍♀️ 🏃 in IQSS Dataverse Project Jan 17, 2024
@landreev
Copy link
Contributor

Producing a proper patch for these bugs will require some work/involve other members of our dev. team.
But I can offer a quick-and-dirty workaround for the more destructive issue (the "huge thumbnail" bug):
The size of the thumbnails on the search results page can be brought back to normal by making the following change in the file .../domain1/applications/dataverse-6.1/resources/css/structure.css:
replace this line:
[id$='resultsTable'] div.card-preview-icon-block img {vertical-align:middle;}
with
[id$='resultsTable'] div.card-preview-icon-block img {vertical-align:middle; max-width: 64px; max-height: 48px;}

This of course will require a Payara restart. And you most likely will need to shift-reload the page, because browsers usually cache css.

@landreev
Copy link
Contributor

One extra bit of intel, per @qqmyers is that it may have been the optimization PR #9669 that caused the issue with a larger image, specifically the part mentioned in the description there as

Switches to returning a download URL (versus a base64-encoded copy) in the main dataset search display

With the commits in question likely being
0c89723
aa75382

The PR in question added much-needed optimizations to the thumbnail management. But a couple of things like the above just need to be adjusted - should be very doable.

@jp-tosca jp-tosca self-assigned this Jan 17, 2024
@jp-tosca jp-tosca moved this from This Sprint 🏃‍♀️ 🏃 to In Progress 💻 in IQSS Dataverse Project Jan 17, 2024
@qqmyers
Copy link
Member

qqmyers commented Jan 17, 2024

W.r.t. the small logo size - my guess would be that https://github.com/IQSS/dataverse/blob/f74e4c1c29a6b7381b0a9c416e7228130ed22307/src/main/java/edu/harvard/iq/dataverse/dataset/DatasetUtil.java#L467C57-L467C57 should use DEFAULT_DATASETLOGO_SIZE rather than card size. This method was added late in QA of #9669 and we must have missed testing the published case. (I haven't fully tested but at QDR we use DEFAULT_PREVIEW_SIZE and limit the display to 140px with css because I think we use the same method to return header metadata (unmerged PR #5621) where we want the larger size.)

@landreev
Copy link
Contributor

... can offer a quick-and-dirty workaround ...

@donsizemore from Odum has built a patched .war file with the css workaround included (thank you, Don!): https://github.com/donsizemore/dataverse_backports/releases/download/6.1_css-patch/dataverse-6.1.war

@landreev
Copy link
Contributor

The three bugs of the apocalypse. The “giant thumbnail bug”, the “tiny thumbnail bug” and the “no thumbnail bug".

@qqmyers
Copy link
Member

qqmyers commented Jan 18, 2024

FWIW: W.r.t. the giant thumbs - the code that was replaced in #9669 was dynamically down-scaling every time the page was viewed, and was embedding the image in the page as a base64-encoded string rather than sending a URL that could be cached by the browser. If the css fix above (and the fact that there is a file size limit to avoid truly huge logos being uploaded) isn't enough, I'd suggest perhaps that we do the down-scaling as part of upload and still reference the image via URL so it can be cached. (Hopefully referencing by URL is also better for the SPA than us creating a api call to get the base64-encoded image string.)

@landreev
Copy link
Contributor

landreev commented Jan 18, 2024

I just said on slack that I'd be ok with adopting the css fix as the actual solution; because of the size limit.
That said, I'm pretty sure that we do already rescale, and cache that logo image. Either on upload, or on the first call from the dataset page.
If you have a dataset into which you uploaded a custom logo, you should see something like this in its file directory:

ls -l files/10.70122/FK2/4Q5JBM/dataset_logo* 
-rw-r--r--  1 landreev  wheel   43721 Jan 17 15:39 files/10.70122/FK2/4Q5JBM/dataset_logo.thumb140
-rw-r--r--  1 landreev  wheel    5666 Jan 17 15:39 files/10.70122/FK2/4Q5JBM/dataset_logo.thumb48
-rw-r--r--  1 landreev  wheel  228542 Jan 17 15:39 files/10.70122/FK2/4Q5JBM/dataset_logo_original

So, I'm perfectly fine with keeping the URL solution in place. But it just looks like that api reads and returns the contents of the dataset_logo_original instead of the dataset_logo.thumb140. And that should be a trivial fix .
Does that make sense?

@sergejzr
Copy link
Author

The patch works great!

Is there a way to change the size of automatically generated thumbs as well? Maybe some settings?

@landreev
Copy link
Contributor

Somebody (namely, @jp-tosca) is working on a fix for that bug as well. There are no similarly quick fixes, from what I understand though - i.e. it should still be simple enough, but will require a code change.

@jp-tosca
Copy link
Contributor

jp-tosca commented Jan 22, 2024

I may have found the 4th horseman... @landreev It seems that when a file is uploaded thumbnail of the customized thumbnail is overwritten, I am still figuring out the behavior sadly having the dataset_logo.thumb48 won't let us have a fix for this until I figure out what is exactly happening to these images.

@jp-tosca
Copy link
Contributor

The difference from the "small/big" thumbnail is caused because when the DS is on draft still uses the base64 conversion but when it is on non-draft it uses the API version, this image block has a max-width and max-height of 140 px and because of this the image doesn't break the entire interface as it does on the search page.

@jp-tosca
Copy link
Contributor

Following with the solution for the main issue it may involve a change where it calls the /thumbnail, not the /logo endpoint but still have to dig a little bit more into it but we may be close to a solution at least for the first and most critical issue.

@landreev
Copy link
Contributor

I'm not sure I understand the above, tbh. The part about the 4th bug; or the last comment, about a different endpoint.

What about Jim's comment above, from 5 days ago ("W.r.t. the small logo size - my guess would be that ...")? - it appears to imply that it may just be a wrong size in one specific code line that needs to be changed - no?

@jp-tosca
Copy link
Contributor

It seems there is another endpoint to bring the thumbnail so I am not sure we need to change right now the behavior of the endpoint but to switch the JSF to use the other one, if you change the source of the image from api/datasets/{id}/logo to api/datasets/{id}/thumbnail it will show the size that we need for the search, look at the image for an example:

fix

The 4th bug if it exists I will documented at a later point, just wanted to bring awareness that could be more involved.

@landreev
Copy link
Contributor

... if you change the source of the image from api/datasets/{id}/logo to api/datasets/{id}/thumbnail it will show the size that we need for the search

I see, so this is still about the "giant thumbnail" bug, got it. That's good to know.

I was mainly asking about the "tiny thumbnail [as derived from an image Datafile]" bug on the dataset page. It just sounded like there was at least a possibility of a very simple fix for that one, if I was reading Jim's comment correctly.

@landreev
Copy link
Contributor

Yeah, if there's another bug, I'd like to see a reproducer/curious to know what it is. I'm not surprised that there could be more bugs. I've been saying this forever, that this whole system of thumbnails is way too complicated; probably unnecessarily so, compared to the actual value of thumbnails as a feature.
It would be great to maybe simplify it somehow when we transition to the SPA. But that's way outside of this issue.
No rush, on this 4th bug - unless it's something over-the-top destructive, like that first "giant thumbnail" bug.

@landreev
Copy link
Contributor

I am not sure we need to change right now the behavior of the [api/datasets/{id}/logo] endpoint

If simply switching to /thumbnail works - sure! It's just a bit hard to think of a practical case where it would be necessary, or useful for that /logo api to return a full-rez image (?). So, that was the reason I assumed we wanted to change its behavior.
But, I'm all in favor of fixing this with the absolute minimum of work invested! Let's keep in mind that it's all cosmetic/none of this is truly catastrophic.

And, as I mentioned earlier, I'm less worried about this particular bug (the "giant thumbnail" one), since we already have a css workaround for it.

@jp-tosca
Copy link
Contributor

I tested this change on DatasetUtil with both values and this didn't help. It seems the code from 441 is not being executed, I will get back when I find more.

@landreev @qqmyers

@jp-tosca
Copy link
Contributor

I am adding a PR in a few adding the CSS that @landreev proposed + some padding to the right, otherwise horizontal oriented thumbnails will look something like this:

image

I will also apply the fix that @qqmyers recommended, that will specifically address the issue that @landreev discovered with the small-medium thumbnails.

After that I will create an issue for the behavior when sometimes you don't get a thumbnail, I will check on 6.0 if this used to happen and add that to the post.

If you guys think we are missing something please let me know.

@landreev
Copy link
Contributor

some padding to the right, otherwise horizontal oriented thumbnails will look something like this:

Hmm. Wondering why max-width: 64px wasn't taking care of that - ? The thumbnail in the screenshot really looks like it's wider than 64 pix. Not the biggest issue, for sure.

@landreev
Copy link
Contributor

@sergejzr We now have a patched .war file with the fixes for this, and a couple of other issues (#10251 and #10200) available at https://github.com/donsizemore/dataverse_backports/releases/tag/6.1_20240126 (thank you, @donsizemore!). These bug fixes have already gone through our normal testing/QA process. We are still offering this build as a prototype at this point, so, if you choose to deploy it, please test it carefully and let us know asap if you encounter any problems.
We appreciate your patience, and your help with this effort.

@pdurbin pdurbin added this to the 6.2 milestone Jan 27, 2024
@lmaylein
Copy link
Contributor

Patched .war file fixes this for us, see https://heidata.uni-heidelberg.de/dataverse/arthistoricum

@landreev
Copy link
Contributor

Patched .war file fixes this for us, see https://heidata.uni-heidelberg.de/dataverse/arthistoricum

I went to look, and had a scare moment - because all the thumbnails on the page still looked huge!! - That of course was because I had been to your site in the last few days, and my browser was caching the old css. Shift-reload fixed it, so, all is good.

@sergejzr
Copy link
Author

@sergejzr We now have a patched .war file with the fixes for this, and a couple of other issues (#10251 and #10200) available at https://github.com/donsizemore/dataverse_backports/releases/tag/6.1_20240126 (thank you, @donsizemore!). These bug fixes have already gone through our normal testing/QA process. We are still offering this build as a prototype at this point, so, if you choose to deploy it, please test it carefully and let us know asap if you encounter any problems.
We appreciate your patience, and your help with this effort.

Looks very good, thank you!! Also the "old" thumbnails are back. I will let it on our test-server for some time and move to production later on.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Size: 30 A percentage of a sprint. 21 hours. (formerly size:33) Type: Bug a defect
Projects
None yet
Development

Successfully merging a pull request may close this issue.

7 participants