VaspInput setter and Incar.check_params() are inconsistent #4119

yantar92 · 2024-10-17T12:46:59Z

Python version

Python 3.12.4

Pymatgen version

2024.7.18

Operating system version

No response

Current behavior

When I set INCAR parametrs via vasp_input['INCAR']['PARAM']=value, they do not always pass Incar.check_params() checks.

Using the example below, I am getting:

...: BadIncarWarning: ENCUT: 550 is not a float
  vasp_input.incar.check_params()
...: BadIncarWarning: GGA: Cannot find Ps in the list of values
  vasp_input.incar.check_params()

Expected Behavior

Setting the parameters passes the checks.

Minimal example

from pymatgen.io.vasp.inputs import VaspInput
vasp_input = VaspInput.from_directory('.')
vasp_input['INCAR']['GGA'] = "MK"
vasp_input['INCAR']['GGA']

Relevant files to reproduce this bug

I used the following INCAR

ALGO = Normal
ENCUT = 550
ISMEAR = 0
NCORE = 16
SIGMA = 0.04
SYSTEM = C.hexagonal.191

Other requried inputs may be anything.

The text was updated successfully, but these errors were encountered:

DanielYang59 · 2024-10-17T14:29:55Z

Thanks for reporting this, I believe the warning on GGA=PS is owing to:

pymatgen/src/pymatgen/io/vasp/inputs.py

Line 988 in 4f7aa35

return val.strip().capitalize()

I.e. any keyword not included inside the following and is not boolean/float/int (str) would be capitalized:

pymatgen/src/pymatgen/io/vasp/inputs.py

Lines 876 to 932 in 4f7aa35

    
           list_keys = ( 
        
               "LDAUU", 
        
               "LDAUL", 
        
               "LDAUJ", 
        
               "MAGMOM", 
        
               "DIPOL", 
        
               "LANGEVIN_GAMMA", 
        
               "QUAD_EFG", 
        
               "EINT", 
        
           ) 
        
           bool_keys = ( 
        
               "LDAU", 
        
               "LWAVE", 
        
               "LSCALU", 
        
               "LCHARG", 
        
               "LPLANE", 
        
               "LUSE_VDW", 
        
               "LHFCALC", 
        
               "ADDGRID", 
        
               "LSORBIT", 
        
               "LNONCOLLINEAR", 
        
           ) 
        
           float_keys = ( 
        
               "EDIFF", 
        
               "SIGMA", 
        
               "TIME", 
        
               "ENCUTFOCK", 
        
               "HFSCREEN", 
        
               "POTIM", 
        
               "EDIFFG", 
        
               "AGGAC", 
        
               "PARAM1", 
        
               "PARAM2", 
        
           ) 
        
           int_keys = ( 
        
               "NSW", 
        
               "NBANDS", 
        
               "NELMIN", 
        
               "ISIF", 
        
               "IBRION", 
        
               "ISPIN", 
        
               "ISTART", 
        
               "ICHARG", 
        
               "NELM", 
        
               "ISMEAR", 
        
               "NPAR", 
        
               "LDAUPRINT", 
        
               "LMAXMIX", 
        
               "ENCUT", 
        
               "NSIM", 
        
               "NKRED", 
        
               "NUPDOWN", 
        
               "ISPIND", 
        
               "LDAUTYPE", 
        
               "IVDW", 
        
           ) 
        
           lower_str_keys = ("ML_MODE",)

The warning on ENCUT=550 should be owing to the incorrect classification of it into int_keys instead of float_keys:

pymatgen/src/pymatgen/io/vasp/inputs.py

Lines 910 to 924 in 4f7aa35

    
           int_keys = ( 
        
               "NSW", 
        
               "NBANDS", 
        
               "NELMIN", 
        
               "ISIF", 
        
               "IBRION", 
        
               "ISPIN", 
        
               "ISTART", 
        
               "ICHARG", 
        
               "NELM", 
        
               "ISMEAR", 
        
               "NPAR", 
        
               "LDAUPRINT", 
        
               "LMAXMIX", 
        
               "ENCUT",

I believe it should be a float:

ENCUT = [real]

yantar92 · 2024-10-23T11:48:16Z

With Version: 2024.10.22, I am still getting

/net/home/plgrid/plgyantar92/groupdir/functional-input.py:70: BadIncarWarning: Cannot find ZAB_VDW in the list of INCAR tags
  vasp_input.incar.check_params()
/net/home/plgrid/plgyantar92/groupdir/functional-input.py:70: BadIncarWarning: GGA: Cannot find Ml in the list of values
  vasp_input.incar.check_params()

DanielYang59 · 2024-10-23T13:44:01Z

Hi thanks for following up. I just have a look and I believe it's a separate issue related to #4042, i.e. we need to update the incar_parameters.json record.

ZAB_VDW is currently not in the recorded INCAR tags, and GGA doesn't have ML in the recording yet:

pymatgen/src/pymatgen/io/vasp/incar_parameters.json

Lines 191 to 218 in 3ee17e2

    
           "GGA": { 
        
             "type": "str", 
        
             "values": [ 
        
                "91", 
        
                "PE", 
        
                "AM", 
        
                "HL", 
        
                "CA", 
        
                "MK", 
        
                "RE", 
        
                "VW", 
        
                "B3", 
        
                "PZ", 
        
                "WI", 
        
                "RP", 
        
                "B5", 
        
                "BF", 
        
                "CO", 
        
                "PS", 
        
                "OR", 
        
                "BO", 
        
                "03", 
        
                "05", 
        
                "10", 
        
                "20", 
        
                "RA", 
        
                "PL" 
        
               ]

But currently I don't have a better way other than doing that manually (which would be very inefficiently) given the amount of possible INCAR tags and their values. Updating the tags should be easier with:

pymatgen/src/pymatgen/io/vasp/help.py

Lines 68 to 83 in 3ee17e2

    
           @classmethod 
        
           def get_incar_tags(cls) -> list[str]: 
        
               """Get a list of all INCAR tags from the VASP wiki.""" 
        
               tags = [] 
        
               for url in ( 
        
                   "https://www.vasp.at/wiki/index.php/Category:INCAR_tag", 
        
                   "https://www.vasp.at/wiki/index.php?title=Category:INCAR_tag&pagefrom=LREAL#mw-pages", 
        
                   "https://www.vasp.at/wiki/index.php?title=Category:INCAR_tag&pagefrom=Profiling#mw-pages", 
        
               ): 
        
                   response = requests.get(url, timeout=60) 
        
                   soup = BeautifulSoup(response.text, features="html.parser") 
        
                   for div in soup.findAll("div", {"class": "mw-category-group"}): 
        
                       children = div.findChildren("li") 
        
                       for child in children: 
        
                           tags.append(child.text.strip()) 
        
               return tags

But updating the values is a bit tricky so I assume we need to helper script to do this? Would you be interested? If not I'm happy to get my hands on this a bit later :)

yantar92 · 2024-10-26T15:16:34Z

But updating the values is a bit tricky so I assume we need to helper script to do this? Would you be interested? If not I'm happy to get my hands on this a bit later :)

    @classmethod
    def get_incar_tags(cls) -> list[str]:
        """Get a list of all INCAR tags from the VASP wiki."""
        url = ("https://www.vasp.at/wiki/api.php?"
               "action=query&list=categorymembers"
               "&cmtitle=Category:INCAR_tag"
               "&cmlimit=500&format=json")
        response = requests.get(url, timeout=60)
        response_dict = json.loads(response.text)

        def extract_titles(data):
            """Extract keywords from from Wikimedia response data.
            See https://www.vasp.at/wiki/api.php?action=help&modules=query
            Returns: List of keywords as strings.
            """
            return [category_data['title'] for category_data
                    in data['query']['categorymembers']]

        tags = extract_titles(response_dict)
        while 'continue' in response_dict:
            response = requests.get(
                url + f"&cmcontinue={response_dict['continue']['cmcontinue']}",
                timeout=60
            )
            response_dict = json.loads(response.text)
            tags = tags + extract_titles(response_dict)

        return tags

I can make it into pull request if you wish.

DanielYang59 · 2024-10-27T04:15:22Z

I can make it into pull request if you wish.

That would be hugely appreciated! You might have misread my previous comment #4119 (comment)

Current we already have a method to get the (tags/keywords/parameters):

pymatgen/src/pymatgen/io/vasp/help.py

Lines 68 to 70 in 0e65d35

    
           @classmethod 
        
           def get_incar_tags(cls) -> list[str]: 
        
               """Get a list of all INCAR tags from the VASP wiki."""

What we need is something to update the values, which is a bit tricky. Currently VASP wiki doesn't seem to provide an API to get possible values of a tag directly (do they?), and the webpage format seems pretty inconsistent and probably make scraping infeasible (I'm not a web scraping expert and I wish I'm wrong here).

For example the ALGO tag page seems to list all possible values at the start:

However the GGA tag only listed a small portion and left the rest inside a table:

* src/pymatgen/io/vasp/help.py (VaspDoc.get_incar_tags): Use Mediawiki API instead of parsing the HTML source directly. The old approach is not stable against changes in the tag list because of the way URLs are constructed. pagefrom= parameters start from certain tag, which is not guaranteed to provide the complete tag list as the new tags are added before that tag given in pagefrom=. At the moment of writing this commit, PRECFOCK tag is already missed using the old approach. Following up: materialsproject#4119 (comment)

yantar92 · 2024-10-27T09:47:03Z

What we need is something to update the values, which is a bit tricky. Currently VASP wiki doesn't seem to provide an API to get possible values of a tag directly (do they?),

I do not see anything either.

the webpage format seems pretty inconsistent and probably make scraping infeasible (I'm not a web scraping expert and I wish I'm wrong here).

Scaping is possible, but awkward. One will need to write the scaper specifically for this page and add a number of asserts to catch incompatible changes (those are unlikely in practice though).

I looked into the source of the page at https://www.vasp.at/wiki/index.php?title=GGA&action=edit
and it looks like they define the tag list simply within a table, not using a template (template would make scaping much easier).

It should be still possible to write a dedicated scraper, via pymediawiki + wikitexparser, but non-standard pages like the one for GGA will need to have a specially tailored handler.

I think that the most productive course of action here would be asking VASP maintainers to list all the possible keys in their TAGDEF template (the template defining GGA = PE | ... line) for this page and maybe for other pages with the same problem.

DanielYang59 · 2024-10-27T11:17:40Z

I think that the most productive course of action here would be asking VASP maintainers to list all the possible keys in their TAGDEF template

It sounds like the best option to me, not sure what is the best approach to reach out though, perhaps through the VASP forum?

yantar92 · 2024-10-27T11:49:26Z

Yup, the forum. I see no better options on https://www.vasp.at/wiki/index.php/The_VASP_Manual#Support

* src/pymatgen/io/vasp/help.py (VaspDoc.get_incar_tags): Use Mediawiki API instead of parsing the HTML source directly. The old approach is not stable against changes in the tag list because of the way URLs are constructed. pagefrom= parameters start from certain tag, which is not guaranteed to provide the complete tag list as the new tags are added before that tag given in pagefrom=. At the moment of writing this commit, PRECFOCK tag is already missed using the old approach. Following up: #4119 (comment)

yantar92 added the bug label Oct 17, 2024

DanielYang59 mentioned this issue Oct 18, 2024

Make Incar keys case insensitive, fix init Incar from dict val processing for str/float/int #4122

Merged

6 tasks

shyuep closed this as completed in #4122 Oct 21, 2024

DanielYang59 mentioned this issue Oct 26, 2024

Read INCAR SYSTEM as is, check_params use proc_val #4136

Merged

yantar92 mentioned this issue Oct 27, 2024

VaspDoc.get_incar_tags: Use Mediawiki API #4141

Merged

1 task

github-actions bot mentioned this issue Nov 1, 2024

Monthly issue metrics report #4149

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

VaspInput setter and Incar.check_params() are inconsistent #4119

VaspInput setter and Incar.check_params() are inconsistent #4119

yantar92 commented Oct 17, 2024 •

edited

Loading

DanielYang59 commented Oct 17, 2024 •

edited

Loading

yantar92 commented Oct 23, 2024

DanielYang59 commented Oct 23, 2024

yantar92 commented Oct 26, 2024 •

edited

Loading

DanielYang59 commented Oct 27, 2024 •

edited

Loading

yantar92 commented Oct 27, 2024

DanielYang59 commented Oct 27, 2024

yantar92 commented Oct 27, 2024 •

edited

Loading

VaspInput setter and Incar.check_params() are inconsistent #4119

VaspInput setter and Incar.check_params() are inconsistent #4119

Comments

yantar92 commented Oct 17, 2024 • edited Loading

Python version

Pymatgen version

Operating system version

Current behavior

Expected Behavior

Minimal example

Relevant files to reproduce this bug

DanielYang59 commented Oct 17, 2024 • edited Loading

yantar92 commented Oct 23, 2024

DanielYang59 commented Oct 23, 2024

yantar92 commented Oct 26, 2024 • edited Loading

DanielYang59 commented Oct 27, 2024 • edited Loading

yantar92 commented Oct 27, 2024

DanielYang59 commented Oct 27, 2024

yantar92 commented Oct 27, 2024 • edited Loading

yantar92 commented Oct 17, 2024 •

edited

Loading

DanielYang59 commented Oct 17, 2024 •

edited

Loading

yantar92 commented Oct 26, 2024 •

edited

Loading

DanielYang59 commented Oct 27, 2024 •

edited

Loading

yantar92 commented Oct 27, 2024 •

edited

Loading