-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
user requirements related to "self data" #364
Comments
Couple of quick comments for particular requirements
@jf87 have you seen ML work that addresses relaxing this requirement? I will also do a quick search and see if I can find something. I think that there has been some work on detecting changes are recomputing only related results. I have opened #366 for the more detailed discussion. |
Yes, the expectation is that every project can have its own consent terms; the consent terms go into the @ipsita0012 had some thoughts from their deployment. |
This is definitely the goal and the framework has been used for travel surveys, behavior change modification and crowdsourcing (hopefully launching today). Do you have concrete suggestions on how to make it more generic? As the use cases submit their changes, the need for a plugin-based architecture for both the phone and the server has become increasingly clear. Is that what you had in mind? |
This is already true at a conceptual level. The recommended way to access e-mission data is through the timeseries interface The timeseries interface has two options - you can get data for an individual user ( As long as all code follows these conceptual guidelines, the rest of it is implementation detail. Although I don't think that having a separate aggregate DB is flexible enough (which ranges would you use for the stored aggregations?), you could certainly cache some results in a separate aggregate database as long as everybody followed the abstractions. And of course, we could choose to switch to a dedicated timeseries database, or give people a choice of databases (e.g. use embedded SQLite for lighter-weight deployments) in the future. The abstractions are what is important. The database is implementation. People should not box themselves into a corner by using the implementation. [1] I have no idea why every project just wants to access the database directly instead of using the recommended methods in the |
Yup, will be in the new architecture! |
The initial implementation of the architecture will use docker without graphene. But using docker without graphene has serious limitations if the cloud provider is compromised, so secure execution is really the long-term goal and solution. I am sure it is possible to implement an ad-hoc crowdsourcing solution for certain specific kinds of analysis (e.g. only automobile speeds as in OpenTraffic) but that is not very interesting to me, because it does not fit my overall vision of longitudinal collection of end to end data across all modes. It seems like you would not even need to store data in that case. You can theoretically compute average speed directly on the phone and send it to a server, but you would need to ensure that sequences of speeds from the same user cannot be correlated. There's been prior work done on that; IIRC vPriv https://people.eecs.berkeley.edu/~raluca/vpriv.pdf uses that model. I am not aware of an open source implementation, though. I don't think any of the work out of Hari's lab at MIT is open source. |
Also, wrt crowdsourcing, this article made a big splash when it came out |
Thanks for your remarks!
|
there are also documents in French, but this document in English from Uk about personal data might interest you: |
for the record i put here these requirements related to "self data"
They can be linked to current work on new architecture with built-in encryption for addressing privacy.
See https://github.com/njriasan/e-mission-docs/blob/master/docs/future_work/NewArchitecture.md
and issue #330
The text was updated successfully, but these errors were encountered: