Br v4.0.0 #103

annakrystalli · 2024-10-09T10:12:12Z

This PR:

BREAKING CHANGE: introduces is_required boolean property at the output_type level to configure whether the output type is required for submissions to be considered valid (introduce is_required field for all output_types #99).
BREAKING CHANGE: Disallows optional property in output_type_id objects. As such, when a given output type is submitted, values for all output type IDs much be submitted (disallow optional values in pmf output_type specifications #100,disallow optional values in cdf output_type specifications #101, disallow optional values in quantile output_type specifications #102).
Introduces optional derived_task_ids property to enable hub administrators to define derived task IDs (i.e. task IDs whose values depend on the values of other task IDs) at a hub level. This allows primarily validation functionality to ignore such task IDs when appropriate which can significantly improve validation efficency (Create property to record derived task IDs #96 ). For more information see hubValidations documentation on ignoring derived task IDs.
Add descriptions to repository object (add specifics about repository object #98)
Also clarified branch naming instruction to align with expectation in hubDocs

We were having trouble commenting the diff in #103 and have no clue why it's not working. Maybe this will fix it?

annakrystalli · 2024-10-09T14:28:33Z

Note I did manage to re-indent with jq but it expanded all arrays to multiple rows too with no clear way how to avoid that. I've tried to manually focus the diff on what's important in v4.0.0 instead.

If someone has better ideas let me know!

zkamvar · 2024-10-09T16:33:37Z

/diff

github-actions · 2024-10-09T16:33:55Z

Here are your diffs for this pull request

`admin-schema.json`

--- v3.0.1/admin-schema.json	2024-11-19 19:40:45.814307408 +0000
+++ v4.0.0/admin-schema.json	2024-11-19 19:40:47.062298932 +0000
@@ -1,6 +1,6 @@
 {
     "$schema": "https://json-schema.org/draft/2020-12/schema",
-    "$id": "https://raw.githubusercontent.com/hubverse-org/schemas/main/v3.0.1/admin-schema.json",
+    "$id": "https://raw.githubusercontent.com/hubverse-org/schemas/main/v4.0.0/admin-schema.json",
     "title": "Schema for Modeling Hub administrative settings",
     "description": "This JSON file provides a schema for modeling hub administrative information.",
     "type": "object",
@@ -59,9 +59,11 @@
                     ]
                 },
                 "owner": {
+                    "description": "The hub repository owner (user or organisation).",
                     "type": "string"
                 },
                 "name": {
+                    "description": "The name of the hub repository.",
                     "type": "string"
                 }
             }

`tasks-schema.json`

--- v3.0.1/tasks-schema.json	2024-11-19 19:40:45.814307408 +0000
+++ v4.0.0/tasks-schema.json	2024-11-19 19:40:47.062298932 +0000
@@ -1,6 +1,6 @@
 {
     "$schema": "https://json-schema.org/draft/2020-12/schema",
-    "$id": "https://raw.githubusercontent.com/hubverse-org/schemas/main/v3.0.1/tasks-schema.json",
+    "$id": "https://raw.githubusercontent.com/hubverse-org/schemas/main/v4.0.0/tasks-schema.json",
     "title": "Schema for Modeling Hub model task definitions",
     "description": "This is the schema of the tasks.json configuration file that defines the tasks within a modeling hub.",
     "type": "object",
@@ -92,7 +92,8 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "forecast_date": {
                                             "description": "An object containing arrays of required and optional unique forecast dates. Forecast date usually defines the date that a model is run to produce a forecast.",
@@ -136,7 +137,8 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "scenario_id": {
                                             "description": "An object containing arrays of required and optional unique identifiers of each valid scenario.",
@@ -194,13 +196,16 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "location": {
                                             "description": "An object containing arrays of required and optional unique identifiers for each valid location, e.g. country codes, FIPS state or county level code etc.",
                                             "examples": [
                                                 {
-                                                    "required": "US",
+                                                    "required": [
+                                                        "US"
+                                                    ],
                                                     "optional": [
                                                         "01",
                                                         "02",
@@ -284,7 +289,8 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "target": {
                                             "description": "An object containing arrays of required and optional unique identifiers for each valid target. Usually represents a single task ID target key variable.",
@@ -332,7 +338,8 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "target_variable": {
                                             "description": "An object containing arrays of required and optional unique identifiers for each valid target variable. Usually forms part of a pair of task ID target key variables (along with target_outcome) which combine to define individual targets.",
@@ -382,7 +389,8 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "target_outcome": {
                                             "description": "An object containing arrays of required and optional unique identifiers for each valid target outcome. Usually forms part of a pair of task ID target key variables (along with target_variable) which combine to define individual targets.",
@@ -430,7 +438,8 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "target_date": {
                                             "description": "An object containing arrays of required and optional unique target dates. For short-term forecasts, the target_date specifies the date of occurrence of the outcome of interest. For instance, if models are requested to forecast the number of hospitalizations that will occur on 2022-07-15, the target_date is 2022-07-15",
@@ -474,7 +483,8 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "target_end_date": {
                                             "description": "An object containing arrays of required and optional unique target end dates. For short-term forecasts, the target_end_date specifies the date of occurrence of the outcome of interest. For instance, if models are requested to forecast the number of hospitalizations that will occur on 2022-07-15, the target_end_date is 2022-07-15",
@@ -518,7 +528,8 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "horizon": {
                                             "description": "An object containing arrays of required and optional unique horizons. Horizons define the difference between the target_date and the origin_date in time units specified by the hub (e.g., may be days, weeks, or months)",
@@ -567,7 +578,8 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "age_group": {
                                             "type": "object",
@@ -611,7 +623,8 @@
                                             "required": [
                                                 "required",
                                                 "optional"
-                                            ]
+                                            ],
+                                            "additionalProperties": false
                                         }
                                     },
                                     "additionalProperties": {
@@ -638,7 +651,8 @@
                                         "required": [
                                             "required",
                                             "optional"
-                                        ]
+                                        ],
+                                        "additionalProperties": false
                                     }
                                 },
                                 "output_type": {
@@ -650,60 +664,23 @@
                                             "description": "Object defining the mean of the predictive distribution output type.",
                                             "properties": {
                                                 "output_type_id": {
-                                                    "description": "output_type_id is not meaningful for a mean output_type. The property is primarily used to determine whether mean is a required or optional output type through properties required and optional. If mean is a required output type, the required property must be an array containing the single string element 'NA' and the optional property must be set to null. If mean is an optional output type, the optional property must be an array containing the single string element 'NA' and the required property must be set to null",
+                                                    "description": "output_type_id is not meaningful for a point estimate output_type. Must have a single property named 'required' with the value null.",
                                                     "examples": [
                                                         {
-                                                            "required": [
-                                                                "NA"
-                                                            ],
-                                                            "optional": null
-                                                        },
-                                                        {
-                                                            "required": null,
-                                                            "optional": [
-                                                                "NA"
-                                                            ]
+                                                            "required": null
                                                         }
                                                     ],
                                                     "type": "object",
-                                                    "oneOf": [
-                                                        {
-                                                            "properties": {
-                                                                "required": {
-                                                                    "description": "When mean is required, property set to single element 'NA' array",
-                                                                    "type": "array",
-                                                                    "items": {
-                                                                        "const": "NA",
-                                                                        "maxItems": 1
-                                                                    }
-                                                                },
-                                                                "optional": {
-                                                                    "description": "When mean is required, property set to null",
-                                                                    "type": "null"
-                                                                }
-                                                            }
-                                                        },
-                                                        {
-                                                            "properties": {
-                                                                "required": {
-                                                                    "description": "When mean is optional, property set to null",
-                                                                    "type": "null"
-                                                                },
-                                                                "optional": {
-                                                                    "description": "When mean is optional, property set to single element 'NA' array",
-                                                                    "type": "array",
-                                                                    "items": {
-                                                                        "const": "NA",
-                                                                        "maxItems": 1
-                                                                    }
-                                                                }
-                                                            }
+                                                    "properties": {
+                                                        "required": {
+                                                            "description": "Not relevant for point estimate output types. Must be null.",
+                                                            "type": "null"
                                                         }
-                                                    ],
+                                                    },
                                                     "required": [
-                                                        "required",
-                                                        "optional"
-                                                    ]
+                                                        "required"
+                                                    ],
+                                                    "additionalProperties": false
                                                 },
                                                 "value": {
                                                     "type": "object",
@@ -740,77 +717,55 @@
                                                     },
                                                     "required": [
                                                         "type"
-                                                    ]
+                                                    ],
+                                                    "additionalProperties": false
+                                                },
+                                                "is_required": {
+                                                    "description": "Is output type required? When required, property should be set to 'true'. If output type is optional, set to 'false'.",
+                                                    "examples": [
+                                                        {
+                                                            "is_required": true
+                                                        },
+                                                        {
+                                                            "is_required": false
+                                                        }
+                                                    ],
+                                                    "type": "boolean"
                                                 }
                                             },
                                             "required": [
                                                 "output_type_id",
-                                                "value"
-                                            ]
+                                                "value",
+                                                "is_required"
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "median": {
                                             "type": "object",
                                             "description": "Object defining the median of the predictive distribution output type",
                                             "properties": {
                                                 "output_type_id": {
-                                                    "description": "output_type_id is not meaningful for a median output_type. The property is primarily used to determine whether median is a required or optional output type through properties required and optional. If median is a required output type, the required property must be an array containing the single string element 'NA' and the optional property must be set to null. If median is an optional output type, the optional property must be an array containing the single string element 'NA' and the required property must be set to null",
+                                                    "description": "output_type_id is not meaningful for a point estimate output_type. Must have a single property named 'required' with the value null.",
                                                     "examples": [
                                                         {
-                                                            "required": [
-                                                                "NA"
-                                                            ],
-                                                            "optional": null
-                                                        },
-                                                        {
-                                                            "required": null,
-                                                            "optional": [
-                                                                "NA"
-                                                            ]
+                                                            "required": null
                                                         }
                                                     ],
                                                     "type": "object",
-                                                    "oneOf": [
-                                                        {
-                                                            "properties": {
-                                                                "required": {
-                                                                    "description": "When median is required, property set to single element 'NA' array",
-                                                                    "type": "array",
-                                                                    "items": {
-                                                                        "const": "NA",
-                                                                        "maxItems": 1
-                                                                    }
-                                                                },
-                                                                "optional": {
-                                                                    "description": "When median is required, property set to null",
-                                                                    "type": "null"
-                                                                }
-                                                            }
-                                                        },
-                                                        {
-                                                            "properties": {
-                                                                "required": {
-                                                                    "description": "When median is optional, property set to null",
-                                                                    "type": "null"
-                                                                },
-                                                                "optional": {
-                                                                    "description": "When median is optional, property set to single element 'NA' array",
-                                                                    "type": "array",
-                                                                    "items": {
-                                                                        "const": "NA",
-                                                                        "maxItems": 1
-                                                                    }
-                                                                }
-                                                            }
+                                                    "properties": {
+                                                        "required": {
+                                                            "description": "Not relevant for point estimate output types. Must be null.",
+                                                            "type": "null"
                                                         }
-                                                    ],
+                                                    },
                                                     "required": [
-                                                        "required",
-                                                        "optional"
-                                                    ]
+                                                        "required"
+                                                    ],
+                                                    "additionalProperties": false
                                                 },
                                                 "value": {
                                                     "type": "object",
-                                                    "description": "Object defining the characteristics of valid median values",
+                                                    "description": "Object defining the characteristics of valid median values.",
                                                     "examples": [
                                                         {
                                                             "type": "double",
@@ -819,7 +774,7 @@
                                                     ],
                                                     "properties": {
                                                         "type": {
-                                                            "description": "Data type of median values",
+                                                            "description": "Data type of median values.",
                                                             "type": "string",
                                                             "enum": [
                                                                 "double",
@@ -843,34 +798,47 @@
                                                     },
                                                     "required": [
                                                         "type"
-                                                    ]
+                                                    ],
+                                                    "additionalProperties": false
+                                                },
+                                                "is_required": {
+                                                    "description": "Is output type required? When required, property should be set to 'true'. If output type is optional, set to 'false'.",
+                                                    "examples": [
+                                                        {
+                                                            "is_required": true
+                                                        },
+                                                        {
+                                                            "is_required": false
+                                                        }
+                                                    ],
+                                                    "type": "boolean"
                                                 }
                                             },
                                             "required": [
                                                 "output_type_id",
-                                                "value"
-                                            ]
+                                                "value",
+                                                "is_required"
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "quantile": {
                                             "description": "Object defining the quantiles of the predictive distribution output type.",
                                             "type": "object",
                                             "properties": {
                                                 "output_type_id": {
-                                                    "description": "Object containing required and optional arrays defining the probability levels at which quantiles of the predictive distribution will be recorded.",
+                                                    "description": "Object containing arrays of required probability levels at which quantiles of the predictive distribution will be recorded.",
                                                     "examples": [
                                                         {
                                                             "required": [
-                                                                0.25,
-                                                                0.5,
-                                                                0.75
-                                                            ],
-                                                            "optional": [
                                                                 0.1,
                                                                 0.2,
+                                                                0.25,
                                                                 0.3,
                                                                 0.4,
+                                                                0.5,
                                                                 0.6,
                                                                 0.7,
+                                                                0.75,
                                                                 0.8,
                                                                 0.9
                                                             ]
@@ -879,24 +847,8 @@
                                                     "type": "object",
                                                     "properties": {
                                                         "required": {
-                                                            "description": "Array of unique probability levels between 0 and 1 that must be present for submission to be valid. Can be null if no probability levels are required and all valid probability levels are specified in the optional property.",
-                                                            "type": [
-                                                                "array",
-                                                                "null"
-                                                            ],
-                                                            "uniqueItems": true,
-                                                            "items": {
-                                                                "type": "number",
-                                                                "minimum": 0,
-                                                                "maximum": 1
-                                                            }
-                                                        },
-                                                        "optional": {
-                                                            "description": "Array of valid but not required unique probability levels. Can be null if all probability levels are required and are specified in the required property.",
-                                                            "type": [
-                                                                "array",
-                                                                "null"
-                                                            ],
+                                                            "description": "Array of unique probability levels between 0 and 1 inclusive that must be present for submission to be valid.",
+                                                            "type": "array",
                                                             "uniqueItems": true,
                                                             "items": {
                                                                 "type": "number",
@@ -906,9 +858,9 @@
                                                         }
                                                     },
                                                     "required": [
-                                                        "required",
-                                                        "optional"
-                                                    ]
+                                                        "required"
+                                                    ],
+                                                    "additionalProperties": false
                                                 },
                                                 "value": {
                                                     "type": "object",
@@ -945,27 +897,41 @@
                                                     },
                                                     "required": [
                                                         "type"
-                                                    ]
+                                                    ],
+                                                    "additionalProperties": false
+                                                },
+                                                "is_required": {
+                                                    "description": "Is output type required? When required, property should be set to 'true'. If output type is optional, set to 'false'.",
+                                                    "examples": [
+                                                        {
+                                                            "is_required": true
+                                                        },
+                                                        {
+                                                            "is_required": false
+                                                        }
+                                                    ],
+                                                    "type": "boolean"
                                                 }
                                             },
                                             "required": [
                                                 "output_type_id",
-                                                "value"
-                                            ]
+                                                "value",
+                                                "is_required"
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "cdf": {
                                             "description": "Object defining the cumulative distribution function of the predictive distribution output type.",
                                             "type": "object",
                                             "properties": {
                                                 "output_type_id": {
-                                                    "description": "Object containing required and optional arrays defining possible values of the target variable at which values of the cumulative distribution function of the predictive distribution will be recorded. These should be listed in order from low to high.",
+                                                    "description": "Object containing required arrays defining possible values of the target variable at which values of the cumulative distribution function of the predictive distribution will be recorded. These should be listed in order from low to high.",
                                                     "examples": [
                                                         {
                                                             "required": [
                                                                 10,
                                                                 20
-                                                            ],
-                                                            "optional": null
+                                                            ]
                                                         },
                                                         {
                                                             "required": [
@@ -977,40 +943,14 @@
                                                                 "EW202245",
                                                                 "EW202246",
                                                                 "EW202247"
-                                                            ],
-                                                            "optional": null
+                                                            ]
                                                         }
                                                     ],
                                                     "type": "object",
                                                     "properties": {
                                                         "required": {
-                                                            "description": "Array of unique target values that must be present for submission to be valid. Can be null if no target values are required and all valid target values are specified in the optional property.",
-                                                            "type": [
-                                                                "array",
-                                                                "null"
-                                                            ],
-                                                            "uniqueItems": true,
-                                                            "items": {
-                                                                "oneOf": [
-                                                                    {
-                                                                        "type": [
-                                                                            "number",
-                                                                            "integer"
-                                                                        ],
-                                                                        "minimum": 0
-                                                                    },
-                                                                    {
-                                                                        "type": "string"
-                                                                    }
-                                                                ]
-                                                            }
-                                                        },
-                                                        "optional": {
-                                                            "description": "Array of valid but not required unique target values. Can be null if all target values are required and are specified in the required property.",
-                                                            "type": [
-                                                                "array",
-                                                                "null"
-                                                            ],
+                                                            "description": "Array of unique target values that must be present for submission to be valid.",
+                                                            "type": "array",
                                                             "uniqueItems": true,
                                                             "items": {
                                                                 "oneOf": [
@@ -1018,8 +958,7 @@
                                                                         "type": [
                                                                             "number",
                                                                             "integer"
-                                                                        ],
-                                                                        "minimum": 0
+                                                                        ]
                                                                     },
                                                                     {
                                                                         "type": "string"
@@ -1029,9 +968,9 @@
                                                         }
                                                     },
                                                     "required": [
-                                                        "required",
-                                                        "optional"
-                                                    ]
+                                                        "required"
+                                                    ],
+                                                    "additionalProperties": false
                                                 },
                                                 "value": {
                                                     "type": "object",
@@ -1057,24 +996,38 @@
                                                         "type",
                                                         "minimum",
                                                         "maximum"
-                                                    ]
+                                                    ],
+                                                    "additionalProperties": false
+                                                },
+                                                "is_required": {
+                                                    "description": "Is output type required? When required, property should be set to 'true'. If output type is optional, set to 'false'.",
+                                                    "examples": [
+                                                        {
+                                                            "is_required": true
+                                                        },
+                                                        {
+                                                            "is_required": false
+                                                        }
+                                                    ],
+                                                    "type": "boolean"
                                                 }
                                             },
                                             "required": [
                                                 "output_type_id",
-                                                "value"
-                                            ]
+                                                "value",
+                                                "is_required"
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "pmf": {
                                             "description": "Object defining a probability mass function for a discrete variable output type. Includes nominal, binary and ordinal variable types.",
                                             "type": "object",
                                             "properties": {
                                                 "output_type_id": {
-                                                    "description": "Object containing required and optional arrays specifying valid categories of a discrete variable. Note that for ordinal variables, the category levels should be listed in order from low to high.",
+                                                    "description": "Object containing arrays of required values specifying valid categories of a discrete variable. Note that for ordinal variables, the category levels should be listed in order from low to high.",
                                                     "examples": [
                                                         {
-                                                            "required": null,
-                                                            "optional": [
+                                                            "required": [
                                                                 "low",
                                                                 "moderate",
                                                                 "high",
@@ -1085,22 +1038,8 @@
                                                     "type": "object",
                                                     "properties": {
                                                         "required": {
-                                                            "description": "Array of unique categories of a discrete variable that must be present for submission to be valid. Can be null if no categories are required and all valid categories are specified in the optional property.",
-                                                            "type": [
-                                                                "array",
-                                                                "null"
-                                                            ],
-                                                            "uniqueItems": true,
-                                                            "items": {
-                                                                "type": "string"
-                                                            }
-                                                        },
-                                                        "optional": {
-                                                            "description": "Array of valid but not required unique categories of a discrete variable. Can be null if all categories are required and are specified in the required property.",
-                                                            "type": [
-                                                                "array",
-                                                                "null"
-                                                            ],
+                                                            "description": "Array of unique categories of a discrete variable that must be present for submission to be valid.",
+                                                            "type": "array",
                                                             "uniqueItems": true,
                                                             "items": {
                                                                 "type": "string"
@@ -1108,9 +1047,9 @@
                                                         }
                                                     },
                                                     "required": [
-                                                        "required",
-                                                        "optional"
-                                                    ]
+                                                        "required"
+                                                    ],
+                                                    "additionalProperties": false
                                                 },
                                                 "value": {
                                                     "type": "object",
@@ -1140,13 +1079,28 @@
                                                         "type",
                                                         "minimum",
                                                         "maximum"
-                                                    ]
+                                                    ],
+                                                    "additionalProperties": false
+                                                },
+                                                "is_required": {
+                                                    "description": "Is output type required? When required, property should be set to 'true'. If output type is optional, set to 'false'.",
+                                                    "examples": [
+                                                        {
+                                                            "is_required": true
+                                                        },
+                                                        {
+                                                            "is_required": false
+                                                        }
+                                                    ],
+                                                    "type": "boolean"
                                                 }
                                             },
                                             "required": [
                                                 "output_type_id",
-                                                "value"
-                                            ]
+                                                "value",
+                                                "is_required"
+                                            ],
+                                            "additionalProperties": false
                                         },
                                         "sample": {
                                             "description": "Object defining a sample output type.",
@@ -1157,7 +1111,6 @@
                                                     "examples": [
                                                         {
                                                             "output_type_id_params": {
-                                                                "is_required": true,
                                                                 "type": "integer",
                                                                 "min_samples_per_task": 100,
                                                                 "max_samples_per_task": 100
@@ -1165,7 +1118,6 @@
                                                         },
                                                         {
                                                             "output_type_id_params": {
-                                                                "is_required": false,
                                                                 "type": "character",
                                                                 "max_length": 6,
                                                                 "min_samples_per_task": 100,
@@ -1181,10 +1133,6 @@
                                                     ],
                                                     "type": "object",
                                                     "properties": {
-                                                        "is_required": {
-                                                            "description": "Boolean. Whether inclusion of samples is required for the submission to be valid",
-                                                            "type": "boolean"
-                                                        },
                                                         "type": {
                                                             "description": "Data type of sample indices.",
                                                             "type": "string",
@@ -1220,7 +1168,6 @@
                                                         }
                                                     },
                                                     "required": [
-                                                        "is_required",
                                                         "type",
                                                         "min_samples_per_task",
                                                         "max_samples_per_task"
@@ -1236,7 +1183,8 @@
                                                         "required": [
                                                             "max_length"
                                                         ]
-                                                    }
+                                                    },
+                                                    "additionalProperties": false
                                                 },
                                                 "value": {
                                                     "type": "object",
@@ -1272,13 +1220,28 @@
                                                     },
                                                     "required": [
                                                         "type"
-                                                    ]
+                                                    ],
+                                                    "additionalProperties": false
+                                                },
+                                                "is_required": {
+                                                    "description": "Is output type required? When required, property should be set to 'true'. If output type is optional, set to 'false'.",
+                                                    "examples": [
+                                                        {
+                                                            "is_required": true
+                                                        },
+                                                        {
+                                                            "is_required": false
+                                                        }
+                                                    ],
+                                                    "type": "boolean"
                                                 }
                                             },
                                             "required": [
                                                 "output_type_id_params",
-                                                "value"
-                                            ]
+                                                "value",
+                                                "is_required"
+                                            ],
+                                            "additionalProperties": false
                                         }
                                     },
                                     "additionalProperties": false
@@ -1340,7 +1303,10 @@
                                                 "type": [
                                                     "object",
                                                     "null"
-                                                ]
+                                                ],
+                                                "additionalProperties": {
+                                                    "type": "string"
+                                                }
                                             },
                                             "description": {
                                                 "description": "a verbose description of the target that might include information such as the target_measure above, or definitions of a 'rate' or similar.",
@@ -1412,7 +1378,8 @@
                                 "task_ids",
                                 "output_type",
                                 "target_metadata"
-                            ]
+                            ],
+                            "additionalProperties": false
                         }
                     },
                     "submissions_due": {
@@ -1449,7 +1416,8 @@
                                     "relative_to",
                                     "start",
                                     "end"
-                                ]
+                                ],
+                                "additionalProperties": false
                             },
                             {
                                 "properties": {
@@ -1467,7 +1435,8 @@
                                 "required": [
                                     "start",
                                     "end"
-                                ]
+                                ],
+                                "additionalProperties": false
                             }
                         ],
                         "required": [
@@ -1503,6 +1472,22 @@
                                 "arrow"
                             ]
                         }
+                    },
+                    "derived_task_ids": {
+                        "description": "Names of derived task IDs, i.e. task IDs whose values are derived from (and therefore dependent on) the values of other variables. Use this property to override the global setting for a specific round.",
+                        "examples": [
+                            [
+                                "target_end_date"
+                            ]
+                        ],
+                        "type": [
+                            "array",
+                            "null"
+                        ],
+                        "uniqueItems": true,
+                        "items": {
+                            "type": "string"
+                        }
                     }
                 },
                 "required": [
@@ -1529,6 +1514,22 @@
                 "logical",
                 "Date"
             ]
+        },
+        "derived_task_ids": {
+            "description": "Names of derived task IDs, i.e. task IDs whose values are derived from (and therefore dependent on) the values of other variables.",
+            "examples": [
+                [
+                    "target_end_date"
+                ]
+            ],
+            "type": [
+                "array",
+                "null"
+            ],
+            "uniqueItems": true,
+            "items": {
+                "type": "string"
+            }
         }
     },
     "required": [

zkamvar · 2024-10-09T16:43:11Z

Note I did manage to re-indent with jq but it expanded all arrays to multiple rows too with no clear way how to avoid that.

I think we should just bend to the formatter. One thing we could do to avoid this is to run 3.0.1 through the formatter in this PR as well so that the diff appears the same. The formatter will not change any structural information of the JSON, just its format.

From there, we can add guidance for future additions to run it through the formatter.

LucieContamin

Thank you for the update, I just added a question about the derived_tasks_ids.

LucieContamin · 2024-10-09T14:32:48Z

v4.0.0/tasks-schema.json

+    "derived_task_ids": {
+        "description": "Names of derived task IDs, i.e. task IDs whose values are derived from (and therefore dependent on) the values of other variables.",
+        "type": [
+            "array",
+            "null"
+        ],
+        "uniqueItems": true,
+        "items": {
+            "type": "string"
+        }
+    },


I might be wrong here but should it not be at the "round" level, as depending on the round, you might have so really different task IDs ? (or is it already?)

I feel like it's a hub property overall in that I can't see many situations where a task ID is derived in one round and not derived in another. The chances of it being stable are far greater than the property changing by round so it would be annoying to have to re-define it again in every round.

I'll make sure however that if a new derived task ID is added to a new round and the task id name added to derived_task_ids, it will not affect validation of older rounds, i.e. if the task ID is not present in the data, derived_task_ids will just be ignored.

The is the potential to allow for a derived_task_ids property at the round level that overrides the overall property but I think we can wait and see if anyone requests that as it feels like an extreme edge case?

Thank you for the additional information.

You make a really good point, however I am worrying a bit about the scenario hubs. They tend to have lot of rounds and have different tasks id for some round as for the need of a specific round or scenario you might need to create new column that are not replicated in the next round.
I am not sure how frequent this new columns will be tagged as derived tasks ids, but even saying that it looks to me a little bit weird to have the tasks id as hub level as they are not define at hub level but at round level.

Interesting. The problem though does not arise from a derived task id not being in all rounds. The problem is if say you create a task id as a derived taks id and then later one change that (i.e. the task ID is no longer derived. That's the only situation where this could be problematic but seems to me not good practice within a hub.

I definitely don't want everyone to have to respecify this property at each round as for the majority of hubs this is very stable so we will for sure keep the overall high level specification. As mentioned there is the option for overriding at the round but I do not want to implement unless its actually necessary

The problem though does not arise from a derived task id not being in all rounds

I misunderstood and I was afraid that would be an issue but if not, that solve some of my problems! Apologize for that!
I agree that changing the behavior of a column is bad practice and should not be supported.

I definitely don't want everyone to have to respecify this property at each round as for the majority of hubs this is very stable so we will for sure keep the overall high level specification.

I also agree, that's to much to ask for something that might not be used often.
And the optional overriding seems like a good idea, and I think a hub can do their own wrapper/function to deal with it if necessary.

So to summarize, thank you for the additional information, I change my mind and I am ok with it being at hub level!

annakrystalli · 2024-10-10T08:34:45Z

I think we should just bend to the formatter. One thing we could do to avoid this is to run 3.0.1 through the formatter in this PR as well so that the diff appears the same. The formatter will not change any structural information of the JSON, just its format.

Good option! See #106

From there, we can add guidance for future additions to run it through the formatter.

I opened the following issue to discuss approach: #107

annakrystalli · 2024-10-10T14:18:09Z

/diff

annakrystalli · 2024-10-10T14:23:18Z

/diff

Merge branch 'main' into br-v4.0.0 # Please enter a commit message to explain why this merge is necessary, # especially if it merges an updated upstream into a topic branch. # # Lines starting with '#' will be ignored, and an empty message aborts # the commit.

…roperty

annakrystalli · 2024-10-11T15:16:30Z

Hey @LucieContamin ! I added a round level property now too so you can now use that to override the global property.

Glad you asked for this feature as going back to introduce it I found that the global derived_task_ids property was nested at the wrong level!! 🙈

annakrystalli · 2024-10-11T15:18:26Z

/diff

Resolves #97

Add `target_keys` property schema

zkamvar · 2024-10-16T20:16:44Z

/diff

zkamvar · 2024-10-18T23:13:59Z

point estimates

Something that was brought up in response to reichlab/variant-nowcast-hub#117 (comment) is that the "NA" is a bit confusing because it sure looks like a character, but when we expand the grid the output_type_id columns become NA (which is an intentional move by Ooms described in section 2.1.1 of the JSONlite package paper)

Now that we are using is_required for point estimate types, we might be able to take this opportunity to set the required property to a single element null array. This will have exactly the same result as the "NA" array, but with the following advantages:

inter-language compatibility can be achieved since null is a concept that even JSON can understand
we can clearly communicate that this should be a missing value in the data as opposed to a character string.

This is what I think it would look like in the schema:

"required": {
    "description": "Not relevant for point estimate output types. Must be a single array of null"
    "type": "array"
    "items": {
        "const": null,
        "maxItems": 1
    }
}

Demo

Here's a demo that shows that ["NA"] and [null] are equivalent by modifying a tasks.json file and reading them in with jsonlite

hub_con <- hubData::connect_hub(
  system.file("testhubs/simple", package = "hubUtils")
)
hp <- attributes(hub_con)$hub_path
# Get the tasks file which has a `mean` output type
tasks <- fs::path(hp, "hub-config", "tasks.json")

# copy the file to a new temp file
tmp <- withr::local_tempfile()
fs::file_copy(tasks, tmp, overwrite = TRUE)
# The two files are identical
unname(tools::md5sum(tmp) == tools::md5sum(tasks))
#> [1] TRUE

cfg <- readLines(tmp)
writeLines(cfg[231:240]) # optional is NA
#>                 "output_type": {
#>                     "mean": {
#>                         "output_type_id": {
#>                             "required": null,
#>                             "optional": ["NA"]
#>                         },
#>                         "value": {
#>                             "type": "integer",
#>                             "minimum": 0
#>                         }

# Change '["NA"]' to '[null]'
nullcfg <- sub('["NA"]', '[null]', cfg, fixed = TRUE)
writeLines(nullcfg[231:240]) # optional is now null
#>                 "output_type": {
#>                     "mean": {
#>                         "output_type_id": {
#>                             "required": null,
#>                             "optional": [null]
#>                         },
#>                         "value": {
#>                             "type": "integer",
#>                             "minimum": 0
#>                         }
writeLines(nullcfg, con = tmp) # changing to null

# The two files are no longer identical
unname(tools::md5sum(tmp) == tools::md5sum(tasks))
#> [1] FALSE
waldo::compare(cfg, nullcfg)
#> old[81:87] vs new[81:87]
#>   "                    \"mean\": {"
#>   "                        \"output_type_id\": {"
#>   "                            \"required\": null,"
#> - "                            \"optional\": [\"NA\"]"
#> + "                            \"optional\": [null]"
#>   "                        },"
#>   "                        \"value\": {"
#>   "                            \"type\": \"integer\","
#> 
#> old[232:238] vs new[232:238]
#>   "                    \"mean\": {"
#>   "                        \"output_type_id\": {"
#>   "                            \"required\": null,"
#> - "                            \"optional\": [\"NA\"]"
#> + "                            \"optional\": [null]"
#>   "                        },"
#>   "                        \"value\": {"
#>   "                            \"type\": \"integer\","


# The resulting R object after reading in JSON are identical
identical(
  jsonlite::fromJSON(tasks, simplifyDataFrame = FALSE), 
  jsonlite::fromJSON(tmp, simplifyDataFrame = FALSE)
)
#> [1] TRUE

^{Created on 2024-10-18 with reprex v2.1.1}

Co-authored-by: Evan Ray <[email protected]>

…ut-type-ids/109 Encode point estimate output type IDs with null.

…esolves #113

…/114 v4 - Do not allow additional properties in lower level objects

zkamvar · 2024-11-19T19:40:30Z

/diff

annakrystalli added 7 commits October 7, 2024 10:07

Update new version branch naming instruction to use br- prefix

527258f

Create v4.0.0 directory

a813ebb

Add derived_task_ids property. Resolves #96

c25f08d

Disallow optional output type ids + introduce 'is_required' property. R…

4d2e935

…esolves #99, #100, #101, #102

Add specifics about repository object. Resolves #98

20d95d2

Update NEWS

2eb7a8e

Correct issue number

a0aa58e

annakrystalli requested review from nickreich and elray1 October 9, 2024 10:21

annakrystalli force-pushed the br-v4.0.0 branch from 05a50f1 to a0aa58e Compare October 9, 2024 12:03

Correct loaction example which should be array

c3a61d7

LucieContamin self-requested a review October 9, 2024 12:30

Fix indents with jq

65d91de

zkamvar added a commit that referenced this pull request Oct 9, 2024

add read all permissions to diff commenter

32afe6b

We were having trouble commenting the diff in #103 and have no clue why it's not working. Maybe this will fix it?

zkamvar mentioned this pull request Oct 9, 2024

add read all permissions to diff commenter #104

Merged

hubverse-org deleted a comment from annakrystalli Oct 9, 2024

LucieContamin reviewed Oct 9, 2024

View reviewed changes

This was referenced Oct 10, 2024

Use jq to un-collapse arrays #106

Merged

Ensure schema files are styled consistently between versions #107

Open

annakrystalli added 4 commits October 10, 2024 17:32

merge main into br-v4.0.0

938c6c2

Merge branch 'main' into br-v4.0.0 # Please enter a commit message to explain why this merge is necessary, # especially if it merges an updated upstream into a topic branch. # # Lines starting with '#' will be ignored, and an empty message aborts # the commit.

Fix typo

6bf0253

Correct derived_task_ids location. Add round level derived_task_ids p…

d29c9b8

…roperty

Add round level derived_task_ids explanation

9134635

annakrystalli requested a review from LucieContamin October 11, 2024 15:16

annakrystalli added 2 commits October 16, 2024 15:07

use additionalProperties to enforce target_keys properties are strings.

7c1fa0e

Resolves #97

Merge pull request #108 from hubverse-org/ak/target_keys-schema/97

392cfdc

Add `target_keys` property schema

annakrystalli mentioned this pull request Oct 21, 2024

Null point estimate output type IDs in configs instead of NAs #109

Closed

annakrystalli and others added 8 commits October 25, 2024 10:51

Encode point estimate output type IDs with null. Resolves #109

bfabc6d

Update v4.0.0/tasks-schema.json

de81fd1

Co-authored-by: Evan Ray <[email protected]>

Update v4.0.0/tasks-schema.json

7543aa7

Co-authored-by: Evan Ray <[email protected]>

Merge pull request #111 from hubverse-org/ak/null-point-estimate-outp…

47beca8

…ut-type-ids/109 Encode point estimate output type IDs with null.

Set additionalProperties to false on lower level objects. Resolvse #114

1f42561

Remove 0 minimum value requirement for cdf numeric output type IDs. R…

1a4ee10

…esolves #113

Update NEWS

bc23e4a

Merge pull request #115 from hubverse-org/ak/v4/additional-properties…

6718e85

…/114 v4 - Do not allow additional properties in lower level objects

annakrystalli merged commit 0163a89 into main Nov 25, 2024
3 checks passed

annakrystalli deleted the br-v4.0.0 branch January 17, 2025 15:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Br v4.0.0 #103

Br v4.0.0 #103

annakrystalli commented Oct 9, 2024 •

edited

Loading

annakrystalli commented Oct 9, 2024

zkamvar commented Oct 9, 2024

github-actions bot commented Oct 9, 2024 •

edited

Loading

zkamvar commented Oct 9, 2024

LucieContamin left a comment

LucieContamin Oct 9, 2024

annakrystalli Oct 10, 2024

LucieContamin Oct 10, 2024

annakrystalli Oct 10, 2024 •

edited

Loading

annakrystalli Oct 10, 2024 •

edited

Loading

LucieContamin Oct 10, 2024

annakrystalli commented Oct 10, 2024

annakrystalli commented Oct 10, 2024

annakrystalli commented Oct 10, 2024

annakrystalli commented Oct 11, 2024

annakrystalli commented Oct 11, 2024

zkamvar commented Oct 16, 2024

zkamvar commented Oct 18, 2024

zkamvar commented Nov 19, 2024

Br v4.0.0 #103

Br v4.0.0 #103

Conversation

annakrystalli commented Oct 9, 2024 • edited Loading

annakrystalli commented Oct 9, 2024

zkamvar commented Oct 9, 2024

github-actions bot commented Oct 9, 2024 • edited Loading

admin-schema.json

tasks-schema.json

zkamvar commented Oct 9, 2024

LucieContamin left a comment

Choose a reason for hiding this comment

LucieContamin Oct 9, 2024

Choose a reason for hiding this comment

annakrystalli Oct 10, 2024

Choose a reason for hiding this comment

LucieContamin Oct 10, 2024

Choose a reason for hiding this comment

annakrystalli Oct 10, 2024 • edited Loading

Choose a reason for hiding this comment

annakrystalli Oct 10, 2024 • edited Loading

Choose a reason for hiding this comment

LucieContamin Oct 10, 2024

Choose a reason for hiding this comment

annakrystalli commented Oct 10, 2024

annakrystalli commented Oct 10, 2024

annakrystalli commented Oct 10, 2024

annakrystalli commented Oct 11, 2024

annakrystalli commented Oct 11, 2024

zkamvar commented Oct 16, 2024

zkamvar commented Oct 18, 2024

point estimates

Demo

zkamvar commented Nov 19, 2024

annakrystalli commented Oct 9, 2024 •

edited

Loading

github-actions bot commented Oct 9, 2024 •

edited

Loading

`admin-schema.json`

`tasks-schema.json`

annakrystalli Oct 10, 2024 •

edited

Loading

annakrystalli Oct 10, 2024 •

edited

Loading