Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error with Creating Serialized Datasets #20

Open
Ping-Song opened this issue Jan 15, 2024 · 9 comments
Open

Error with Creating Serialized Datasets #20

Ping-Song opened this issue Jan 15, 2024 · 9 comments

Comments

@Ping-Song
Copy link

When I was creating serialized datasets with the following code, it always show the error 'no matches found'
Code: create_external_datasets.py --dataset (car|income|diabetes|heart|bank|blood|calhousing|creditg|jungle) (--list) (--list (--tabletotext|--t0serialization|--values|--permuted|--shuffled))
Error: zsh: no matches found: (car|income|diabetes|heart|bank|blood|calhousing|creditg|jungle)

I did run the previous code in Preparing the Environments section.

@Ping-Song
Copy link
Author

I also found out that cudatoolkit cannot be installed, do you know why?

@stefanhgm
Copy link
Contributor

Hello @Ping-Song ,

thanks for using our code and for reaching out with this issue.

Just to be sure, what is the exact command that you are running?

To get a serialization for the car dataset you should run:

create_external_datasets.py --dataset car

Does this work or what output does it create?

Thank you!

@Ping-Song
Copy link
Author

Hi, thank you for your reply. I tried it again, it says "zsh: command not found: create_external_datasets.py"

@Ping-Song
Copy link
Author

When I was running "conda install pytorch==1.10.1 torchvision==0.11.2 torchaudio==0.10.1 cudatoolkit=11.3 -c pytorch -c conda-forge". There are PackageNotFoundError:

PackagesNotFoundError: The following packages are not available from current channels:

  • cudatoolkit=11.3
  • torchaudio==0.10.1

Could you please tell me how to install this 2 packages or what alternative package I can use?

@Ping-Song
Copy link
Author

Hello @Ping-Song ,

thanks for using our code and for reaching out with this issue.

Just to be sure, what is the exact command that you are running?

To get a serialization for the car dataset you should run:

create_external_datasets.py --dataset car

Does this work or what output does it create?

Thank you!

Hi, should I run "create_external_datasets.py --dataset car" or "create_external_datasets.py --dataset car (--list) (--list (--tabletotext|--t0serialization|--values|--permuted|--shuffled))" as shown in this github instruction?

@Ping-Song
Copy link
Author

Hi, I managed to run the serialization, but there is one additional error: AttributeError: module 'huggingface_hub.constants' has no attribute 'HF_HUB_CACHE'.

Can you help me with this please? Thanks!

@stefanhgm
Copy link
Contributor

Hello @Ping-Song ,

Hi, thank you for your reply. I tried it again, it says "zsh: command not found: create_external_datasets.py"

Please make sure that you are in the correct directory.

Hi, should I run "create_external_datasets.py --dataset car" or "create_external_datasets.py --dataset car (--list) (--list (--tabletotext|--t0serialization|--values|--permuted|--shuffled))" as shown in this github instruction?

The second command does not work. It just shows all possible options you can run the script with. You have to choose the options depending on what you need. The first command should work.

When I was running "conda install pytorch==1.10.1 torchvision==0.11.2 torchaudio==0.10.1 cudatoolkit=11.3 -c pytorch -c conda-forge". There are PackageNotFoundError:

PackagesNotFoundError: The following packages are not available from current channels:

cudatoolkit=11.3
torchaudio==0.10.1
Could you please tell me how to install this 2 packages or what alternative package I can use?

Sorry for that. The package can often cause problems as version are not compatible anymore. Often you have to try around a bit and check online for alternatives. What worked for me the last time I setup the environment was the following:

conda create -n tabllm python==3.8
conda activate tabllm
pip install fsspec==2023.5.0
pip install click==7.1.2

The using the requirements file of t-few (https://github.com/r-three/t-few/blob/master/requirements.txt).

pip install --use-deprecated=legacy-resolver  -r requirements.txt -f https://download.pytorch.org/whl/cu113/torch_stable.html
pip install urllib3==1.26.6
pip install importlib-metadata==4.13.0
pip install scikit-learn
pip install setuptools==59.5.0
pip install protobuf==3.20.3

I used the resulting environment for both t-few and TabLLM code.

Hope that works!

@stefanhgm
Copy link
Contributor

Hello @Ping-Song,

Sorry, I do not know this error. Maybe setting the environment variable can help?

export HF_HUB_CACHE=~/.cache/huggingface

@Ping-Song
Copy link
Author

Hello @Ping-Song,

Sorry, I do not know this error. Maybe setting the environment variable can help?

export HF_HUB_CACHE=~/.cache/huggingface

Hi Thanks for your prompt reply! I did set the environment variable.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants