-
-
Notifications
You must be signed in to change notification settings - Fork 170
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Pixiv metadata plugin #950
Conversation
* add metadata by pixiv illustration id * add metadata from archive title * add en tags, illust id capture * support translated tags * add default user agent * clean up * streamline illust metadata logic * add manga specific metadata * readme
fix bug where series metadata exists but is null
sanitize tags correct url embed
strong pattern matching
* add pixiv tests * add pixiv to module tests * pass github actions (#7) fix tests so they pass
* add refactor and tests * fix pixiv tests
For some of the fields, (e.g. user_id, series_id), this is exclusively a pixiv property, but different sources might also have their own user_id field which might result in possible metadata conflict. I'm thinking there are few ways to resolve this, which one is better?
|
A pixiv-specific namespace sounds like the way to go to me. |
👋 Apologies for the long time reviewing this; Stuff's been busy! On top of it, well, I don't have much to say at all... this looks perfect. 😤 |
Congratulations @psilabs-dev, you just earned a holobyte! Here it is: https://holopin.io/holobyte/cluagx9w2448240fjqqanb8t72 This badge can only be claimed by you, so make sure that your GitHub account is linked to your Holopin account. You can manage those preferences here: https://holopin.io/account. |
Thanks👌 Development was easy thanks to seeing/copying the previous plugins haha, I just cleaned it up a bit. Love the project! |
Adds Pixiv cookie login and metadata extraction for illustrations from pixiv.net.
usage
Supported one-shot parameter format (can extract $illust_id for metadata extraction):
$illust_id
pixiv.net/en/artworks/$illust_id
Supported archive file format:
{$illust_id} title_of_work
pixiv_{$illust_id} title_of_work
parameters
tag_languages
: include comma-separated list of strings for languages to extract (e.g. "en", "jp"). By default, an empty string parameter corresponds to extraction of untranslated tags, i.e. "jp" tags.metadata
A generic Pixiv illustration includes the following extractable metadata:
tags: list[str]
: list of comma-delimited strings for tags.source: str
: URL link to Pixiv artworkpixiv_user_id: int
: artist ID on pixiv (can be used to find user channel:https://pixiv.net/en/users/$user_id
artist: str
: name of artist/usernamedate_created: int
: epoch time of artwork creation in secondsdate_uploaded: int
: epoch time of upload in secondsIf the illustration is a manga, this plugin may include additional manga metadata:
pixiv_series_id: int
: ID of the series which this manga belongs to.pixiv_series_title: str
: name of the manga seriespixiv_series_order: int
: position of this illustration within the manga seriesTaggable strings (e.g. tags, artist, series title) are sanitized of special characters by the following logic:
["?*%$:]
) are removed (e.g. "lorem: ipsum" -> "lorem ipsum")