Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Auto-settings rule based on "category" #994

Open
gildas-lormeau opened this issue Jul 6, 2022 Discussed in #986 · 1 comment
Open

Auto-settings rule based on "category" #994

gildas-lormeau opened this issue Jul 6, 2022 Discussed in #986 · 1 comment

Comments

@gildas-lormeau
Copy link
Owner

Discussed in #986

Originally posted by thecheesemo June 23, 2022
I have been using SingleFile extensively to manually save pages for later reference. I do this mainly for archiving purposes and because i have a bad memory. I've since realized that I always want to save every news article I read. So, I've started to define Auto-settings rules with a specific SingleFile profile for each url of news sites i visit, This profile auto saves pages, and that works fine. However, like most people on the web now, i don't go to just one news source, I get most of my news from link aggregators like google news and reddit. I have to manually add each news website's top level domain (TLD) to the Auto-settings rule with my news autosave profile.

Could it be possible to create an option in the Auto-settings that would let you define a profile based on the "category" of the TDL?

I don't even know if there exists such a list that can be referenced, or how one would go about making one other then a manual process. But if such a list does exist, or can be created easily enough, that would be a pretty great feature.

@lumnikemel
Copy link

I did some searching, and it looks like all the APIs to look up categorized webpages are very limited (throttled) or require payment.

You could implement one yourself: https://github.com/domantasm96/URL-categorization-using-machine-learning.
There are a number of project on github for URL classification that might suit this purpose.

I figured that there would at least be a pre-classified list of top 1000 sites, but having a hard time finding that, too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants