Sort entries by primary reading #1497

khaitruong922 · 2024-10-17T10:50:08Z

Add primary_reading param.
When click on gloss link, set furigana as primary_reading if exists.
Entries with reading matches primary_reading will be sorted to top.

Works correctly for Jitendex gloss link format. Don't know if there are any kind of dictionary format which will break this feature, since the reading is depended on the classes naming.

codspeed-hq · 2024-10-17T14:15:10Z

CodSpeed Performance Report

Merging #1497 will not alter performance

_{Comparing khaitruong922:search-reading (521cb1e) with master (9c5f824)}

Summary

✅ 2 untouched benchmarks

🆕 1 new benchmarks
⁉️ 1 (👁 1) dropped benchmarks

Benchmarks breakdown

	Benchmark	`master`	`khaitruong922:search-reading`	Change
👁	`Translator.prototype.findTerms - (n=43)`	154.8 ms	N/A	N/A
🆕	`Translator.prototype.findTerms - (n=45)`	N/A	157.8 ms	N/A

…itan into search-reading

khaitruong922 · 2024-10-17T14:30:27Z

Ready for review

Extract furigana from gloss link and add primary_reading search param

Sort entries which match primary_reading to top

stephenmk · 2024-10-18T16:10:41Z

Thank you for working on this. It looks good.

Works correctly for Jitendex gloss link format. Don't know if there are any kind of dictionary format which will break this feature, since the reading is depended on the classes naming.

I don't like this approach. I think it would be better to rely on the dictionary data to specify the ?primary_reading= parameter. I don't think it's a good idea to try to parse the primary reading from the furigana. This increases the code complexity and could lead to unexpected behavior. It's possible that someone may want to manually specify a ?primary_reading= parameter that is not equivalent to the furigana readings within the link.

In other words, I would remove this block of code:

yomitan/ext/js/display/structured-content-generator.js

Lines 459 to 481 in 051a2f0

    
           if (internal) { 
        
               let query = ''; 
        
               if (href.length > 1) { 
        
                   let hasFurigana = false; 
        
                   let reading = ''; 
        
                   for (const childNode of text.childNodes) { 
        
                       if (childNode instanceof HTMLElement) { 
        
                           const furigana = childNode.querySelector('.gloss-sc-rt')?.textContent; 
        
                           if (furigana && furigana.length > 0) { 
        
                               reading += furigana; 
        
                               hasFurigana = true; 
        
                           } 
        
                       } else { 
        
                           reading += childNode.textContent ?? ''; 
        
                       } 
        
                   } 
        
                   query = href; 
        
                   if (reading.length > 0 && hasFurigana) { 
        
                       query += `&primary_reading=${reading}`; 
        
                   } 
        
               } 
        
               href = `${location.protocol}//${location.host}/search.html${query}`; 
        
           }

Kuuuube · 2024-10-18T16:26:16Z

I think it would be better to rely on the dictionary data to specify the ?primary_reading= parameter.

Agreed.

khaitruong922 · 2024-10-18T16:30:50Z

The only place that user could specify the primary_reading is the search page, by modifying the url. I don't think this is the main use case, and no user will be aware of it.

The main purpose of this PR is to extract the reading from furigana, then sort the terms with that reading on top. For example, suppose that my dictionary has 縁 (えん) above 縁 (ふち). When I click on a word link 縁 (ふち), the entry 縁 (ふち) should be above 縁 (えん).

As far as I know, only Jitendex has furigana for word link. I agree that the implementation is quite fragile, we can find better ways to handle this.

khaitruong922 · 2024-10-18T16:40:44Z

I also agree with having the primary_reading specified inside the href field of Jitendex, then we can remove the extract furigana code.

stephenmk · 2024-10-18T18:47:47Z

I just published a new version of Jitendex that includes the primary_reading parameter in hyperlinks.

khaitruong922 · 2024-10-19T16:33:56Z

Tested with new version of Jitendex. Works great!

My.Video.mp4

MarvNC

so much work needed to just pass variables around

test/utilities/translator.js

Search term by exact reading when clicking on gloss link

c8a68f8

khaitruong922 changed the title ~~Search term by exact reading when clicking on gloss link~~ Search term by exact reading when clicking on gloss link with furigana Oct 17, 2024

khaitruong922 and others added 7 commits October 17, 2024 17:53

lint

bdbc635

update typing

1af5cde

type

e5a9374

sort by reading match

9689bbe

fix lint and test

3332465

add test

8038444

Merge branch 'master' into search-reading

1fa806c

khaitruong922 marked this pull request as ready for review October 17, 2024 14:13

khaitruong922 requested review from a team as code owners October 17, 2024 14:13

khaitruong922 changed the title ~~Search term by exact reading when clicking on gloss link with furigana~~ Sort term by exact reading when clicking on gloss link with furigana Oct 17, 2024

khaitruong922 added 2 commits October 17, 2024 21:21

fix typing

2d9609b

Merge branch 'search-reading' of https://github.com/khaitruong922/yom…

1dd2ca0

…itan into search-reading

khaitruong922 added 4 commits October 17, 2024 21:32

rename

c1bfc31

refactor

e8fab22

refactor

f3af6a3

rename to primary reading

051a2f0

khaitruong922 changed the title ~~Sort term by exact reading when clicking on gloss link with furigana~~ Sort term by primary reading when clicking on gloss link with furigana Oct 17, 2024

khaitruong922 changed the title ~~Sort term by primary reading when clicking on gloss link with furigana~~ Sort entries by primary reading when clicking on gloss link with furigana Oct 17, 2024

remove extract reading implementation

521cb1e

khaitruong922 changed the title ~~Sort entries by primary reading when clicking on gloss link with furigana~~ Sort entries by primary reading Oct 18, 2024

stephenmk mentioned this pull request Oct 19, 2024

Add primary_reading option for "dictionary deinflection" definition type #1507

Open

MarvNC approved these changes Oct 26, 2024

View reviewed changes

test/utilities/translator.js Show resolved Hide resolved

MarvNC added this pull request to the merge queue Oct 29, 2024

MarvNC added the kind/enhancement The issue or PR is a new feature or request label Oct 29, 2024

Merged via the queue into yomidevs:master with commit 0fd7009 Oct 29, 2024
11 checks passed

khaitruong922 deleted the search-reading branch November 5, 2024 07:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sort entries by primary reading #1497

Sort entries by primary reading #1497

khaitruong922 commented Oct 17, 2024 •

edited

Loading

codspeed-hq bot commented Oct 17, 2024 •

edited

Loading

khaitruong922 commented Oct 17, 2024 •

edited

Loading

stephenmk commented Oct 18, 2024

Kuuuube commented Oct 18, 2024

khaitruong922 commented Oct 18, 2024

khaitruong922 commented Oct 18, 2024

stephenmk commented Oct 18, 2024

khaitruong922 commented Oct 19, 2024

MarvNC left a comment •

edited

Loading

Sort entries by primary reading #1497

Sort entries by primary reading #1497

Conversation

khaitruong922 commented Oct 17, 2024 • edited Loading

codspeed-hq bot commented Oct 17, 2024 • edited Loading

CodSpeed Performance Report

Merging #1497 will not alter performance

Summary

Benchmarks breakdown

khaitruong922 commented Oct 17, 2024 • edited Loading

stephenmk commented Oct 18, 2024

Kuuuube commented Oct 18, 2024

khaitruong922 commented Oct 18, 2024

khaitruong922 commented Oct 18, 2024

stephenmk commented Oct 18, 2024

khaitruong922 commented Oct 19, 2024

MarvNC left a comment • edited Loading

Choose a reason for hiding this comment

khaitruong922 commented Oct 17, 2024 •

edited

Loading

codspeed-hq bot commented Oct 17, 2024 •

edited

Loading

khaitruong922 commented Oct 17, 2024 •

edited

Loading

MarvNC left a comment •

edited

Loading