Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updated few code files which had few typos and errors in the code. #134

Open
wants to merge 4 commits into
base: main
Choose a base branch
from

Commits on Oct 4, 2024

  1. Update web_crawler.py

    vignesh1507 authored Oct 4, 2024
    Configuration menu
    Copy the full SHA
    1677967 View commit details
    Browse the repository at this point in the history
  2. Update utils.py

    vignesh1507 authored Oct 4, 2024
    Configuration menu
    Copy the full SHA
    7a41308 View commit details
    Browse the repository at this point in the history
  3. Update train.py

    error:
    
    Incorrect pipe name (textcat_multilabel): By default, SpaCy's TextCategorizer pipeline is called textcat, not textcat_multilabel. You need to ensure that you have installed and are using a specific multi-label classification pipeline or that you're setting it up correctly. If you're aiming to use multi-label classification, consider handling it via textcat and properly setting multi-label categorization in your training data.
    
    
    changes that i have made to the code:
    
    textcat: Replaced textcat_multilabel with textcat.
    
    Fixed random seed: Removed spacy.util.fix_random_seed and just used random.seed.
    
    Vocabulary filtering: Added isalpha() check for vocabulary words to ensure only alphabetic words are saved.
    
    Ensure you have the spacy-textcat component installed for multi-label text categorization.
    vignesh1507 authored Oct 4, 2024
    Configuration menu
    Copy the full SHA
    72e605c View commit details
    Browse the repository at this point in the history
  4. Update prompts.py

    vignesh1507 authored Oct 4, 2024
    Configuration menu
    Copy the full SHA
    f6ecb6f View commit details
    Browse the repository at this point in the history