-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test the pipelines data #206
Comments
Hello @OriHoch, I hope you're doing well! I came across this issue and I'd like to contribute by helping to refine the task and make it more feasible. To better understand the scope and specific requirements of this task, I have a few questions that will help us determine the best approach:
Lastly, I'd like to suggest breaking down this task into several subtasks, which can help make the process more manageable and allow for better tracking of progress. Here are a few examples of subtasks:
Please let me know if these questions and suggestions resonate with your vision for the task or if you have any additional thoughts or concerns. I look forward to your response and working together to improve the quality of the data generated by this pipeline. |
Everything is defined in the pipeline yamls, most of the data is from the Knesset APIs. For example this yaml. In it you will see the first pipeline -
We haven't defined any specific metrics, but all of those are important, most important I guess is accuracy
You are free to explore and suggest.
All the data is public and available in SQL via Redash and CSV files, how to access it is described in the website homepage - https://oknesset.org/
We don't have any definitions, but you can see which data is more interesting / important by looking at the user surveys and site specs which are linked to in this issue. Also, it's worth to talk with Assaf Shapira which has some ideas regarding what to do with the data and how to analyze it.
A successful outcome would be to know that some part of the data (e.g. committees data) is accurate and complete, or if you open bugs for the data.
I invited you to the organization, you should have permissions to open issues, feel free to open issues for subtasks
Sounds good, it would be really useful to have someone define and apply methodologies which will ensure the quality of our data! |
I'm assigning the issue to you, doesn't mean you neccesarily have to implement everything, but I think it would be good if you could centralize the efforts for it and direct other developers that might want to help.. |
we generate a lot of data, but is the data even any good? need to test it..
The text was updated successfully, but these errors were encountered: