-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.html
1762 lines (1227 loc) · 50.1 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
<!DOCTYPE html>
<html>
<head>
<title>Introduction to Data Analytics - NYC Parks</title>
<meta charset="utf-8">
<link rel="stylesheet" href="slide.css"/>
</head>
<body>
<textarea id="source">
layout:true
<div class="header" style="display:block; text-align: left; color:gray; font-size:1em; position: fixed; top: 0px; left: 0px; height: 30px;vertical-align:middle;margin:7px 0 0 0;width:100%;background: #fcfcfc;
background: -moz-linear-gradient(top, #fcfcfc 0%, #d1d1d1 100%);
background: -webkit-linear-gradient(top, #fcfcfc 0%,#d1d1d1 100%);
background: linear-gradient(to bottom, #fcfcfc 0%,#d1d1d1 100%);
filter: progid:DXImageTransform.Microsoft.gradient( startColorstr='#fcfcfc', endColorstr='#d1d1d1',GradientType=0 );">
<img src="images/datapolitan_transparent_small.png" style="position: fixed;top: 2px; left: 5px; width: 14%;">
<p style="display:block;text-align:left;color:gray;font-size:1em; position: fixed; top: 0px; left: 155px; height: 30px;vertical-align:middle;margin:7px 0 0 0; ">Introduction to Data Analytics - NYC Parks and Recreation</p>
</div>
<div class="footer" style="display:block;color:gray;position:fixed;bottom: 0px;left:0px;height:30px;vertical-align:middle;margin:0 0 0 0;width:100%;background: #fcfcfc;background: -moz-linear-gradient(top, #fcfcfc 0%, #d1d1d1 100%);background: -webkit-linear-gradient(top, #fcfcfc 0%,#d1d1d1 100%);background: linear-gradient(to bottom, #fcfcfc 0%,#d1d1d1 100%);sfilter: progid:DXImageTransform.Microsoft.gradient( startColorstr='#fcfcfc', endColorstr='#d1d1d1',GradientType=0 );">
<p class="footer">
<span xmlns:dct="http://purl.org/dc/terms/" property="dct:title">Introduction to Data Analytics</span> by <a xmlns:cc="http://creativecommons.org/ns#" href="http://www.datapolitan.com" target="_blank" property="cc:attributionName" rel="cc:attributionURL">Richard Dunks</a> and <a xmlns:cc="http://creativecommons.org/ns#" href="http://www.tinypanther.pizza" property="cc:attributionName" rel="cc:attributionURL">Julia Marden</a> is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/">Creative Commons Attribution-ShareAlike 4.0 International License</a>.<a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/"><img alt="Creative-Commons-License" style="border-width:0" src="https://i.creativecommons.org/l/by-sa/4.0/80x15.png" /></a>
</p>
</div>
<!-- <p class="footer">
<span xmlns:dct="http://purl.org/dc/terms/" property="dct:title">Data Analytics for NYC Parks</span> by <a xmlns:cc="http://creativecommons.org/ns#" href="http://www.datapolitan.com" property="cc:attributionName" rel="cc:attributionURL">Richard Dunks</a> and <a xmlns:cc="http://creativecommons.org/ns#" href="http://www.tinypanther.pizza" property="cc:attributionName" rel="cc:attributionURL">Julia Marden</a> is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/">Creative Commons Attribution-ShareAlike 4.0 International License</a>.<a rel="license" href="http://creativecommons.org/licenses/by-sa/4.0/"><img alt="Creative-Commons-License" style="border-width:0" src="https://i.creativecommons.org/l/by-sa/4.0/80x15.png" /></a>
</p> -->
--
class:center, middle
![img-center-50](images/datapolitan.png)
# Introduction to Data Analytics<br>City of New York Parks and Recreation
- - -
## Instructors: Elizabeth DiLuzio + Gemma Duffee
### Follow along at: http://bit.ly/parks-intro-analytics
---
class:center,middle
# Welcome
## [Click to Skip to Afternoon Session](#lunch)
---
exclude:true
# Data Driven Culture
![img-center-95](images/ddc_compass.png)
???
+ A placeholder for a later discussion on data driven culture
+ Establishes the "ideological" orientation of the class
---
# A Few Ground Rules
???
+ Facilitators establish the intention we have for the culture of the classroom
--
+ Step up, step back
--
+ Be curious and ask questions!
--
+ Assume noble regard and positive intent
--
+ Respect multiple perspectives
--
+ Listen deeply
--
+ Be present (phone, email, social media, etc.)
---
# Introduce Yourself to Your Neighbor
+ Who are you?
+ Where do you work?
+ What has been the proudest moment in your job?
+ What's your favorite park?
???
+ An opportunity for participants to get to know each other and settle into the class
+ We bring them into a positive space related to their work
+ Facilitator reminds them the data isn't the end but the means to achieve the actions that make us feel good about our jobs
---
# What to Expect Today
+ 9:40 – Data Analytics 101
+ 10:00 - Introduction to Problem Ideation
+ 10:30 – 15 min break
+ 10:45 – Process Mapping
+ 12:00 – Lunch
+ 1:00 – Process Mapping Part 2
+ 2:30 – 15 min break
+ 2:45 – Data Analytics Exercise
+ 4:30 – Dismissal
???
+ Outline of activities for participants awareness
---
# Goals for This Class
--
???
+ Sets expectations for the students
+ Light comedy is intended to relax participants concerned about the content
--
+ Explore the elements of a data-driven culture
--
+ Learn about different types of analysis
--
+ Practice mapping out the analytics process
--
+ Get hands-on practice with Parks data
--
+ Helpful demonstration of key skills in Excel
--
+ Some great graphics (courtesy of [Julia Marden](http://tinypanther.com/) and the Internet)
--
+ A good, worthwhile day geeking out about data
---
# Key Things to Keep in Mind
???
+ Makes clear the key outcomes for the class and intentions for the project
+ Manage expectations for a heavily techical class
--
+ Our hope is to introduce important concepts and allow us to discuss them as a class
--
+ Our focus today is process over product
--
+ This is a chance to develop experience collaborating together on problems
--
+ Our ultimate goal is to help you leverage data to improve operations in your areas of responsibility
---
# Housekeeping
???
+ Facilitator sets expectations with the students
+ Establishes the "contract" for the class
--
+ We’ll have one 15 minute break in the morning
--
+ We’ll have an hour for lunch
--
+ We’ll have a 15 minute break in the afternoon
--
+ Class will start promptly after breaks
--
+ Feel free to use the bathroom if you need during class
--
+ Please take any phone conversations into the hall to not disrupt the class
---
exclude:true
# Data Driven Decisions Require Humans
![img-center-50](images/iphone_intel.jpg)
.caption[Image Credit: 100 lion, [CC BY-SA 4.0](http://creativecommons.org/licenses/by-sa/4.0), via [Wikimedia Commons](https://commons.wikimedia.org/wiki/File%3AIphone-62.jpg)]
## [How Intel lost out on the contract of a lifetime](http://appleinsider.com/articles/13/05/16/intel-chips-could-have-powered-first-iphone-ceo-otelini-says)
???
+ Facilitator reinforces point about human mental intermediation in using data
---
class:center, middle
# Let's talk data.
---
# The Value of Data
???
+ Facilitator prompts participants to reflect on the value of data in their work
+ Helpful to ask "how" when presented with reasons like "it tells us what happened" in order to get them thinking about the more intrinsic value of data
+ Also prompt to think about how they would manage something without data (experience, anecdote, etc.), particularly some process, office, effort with lots of elements or moving parts
--
+ Data tells a story about something that's happened
--
+ Can describe what happened directly or indirectly
--
![img-center-80](images/311data_parks.png)
???
+ Ex: "are the parks worst 10-10:59am? Why or why not?"
---
class: center,middle
# Are All Data Points Created Equal?
???
+ Facilitator prompts participants to think about data quality issues they experience and how it impacts their work
+ Also any other data issues they feel important to mention
---
class:middle
> Facts do not "speak for themselves." They speak for or against competing theories. Facts divorced from theory or visions are mere isolated curiosities.
## -Thomas Sowell *A Conflict of Visions*
???
+ Facilitator has participant read the quote and invites the class to reflect
+ Some will take this as talking about manipulation of data. It's more about the manipulation of context. The numbers aren't wrong, but are misused or not applied in the right context
---
exclude:true
# Being data-driven doesn't mean blindly following data
![img-center-80](images/blindlyfollowdata.jpg)
.caption[Image Credit: zhouxuan12345678, [CC BY-SA 2.0](https://creativecommons.org/licenses/by/2.0/), via [Flickr](https://www.flickr.com/photos/53921113@NO2/5453212152)]
---
exclude:true
class:middle
> Data is only as valuable as the decisions it enables.
## -[Ion Stoica](https://twitter.com/databricks/status/810190036624875520)
---
# What is Analysis?
???
+ Facilitator prompts participants to reflect on a definition for analysis
+ Facilitator reinforces the essential role of storytelling in good analysis
--
>“Analysis is simply the pursuit of understanding, usually through detailed inspection or comparison”
## - Carter Hewgley, Director of Analytics, [Center for Government Excellence](https://govex.jhu.edu/)
---
class:center, middle
# Data Analysis Should Drive Decision Making
#
##
---
# Data Analysis Should Drive Decision Making
![img-center-80](images/datadriven.png)
.caption[Image Credit: Data Driven School (https://www.slideshare.net/moshiiit/data-driven-school-mis-case-study)]
---
>“We move from data to information to knowledge to wisdom. And separating one from the other… knowing the limitations and the danger of exercising one without the others, while respecting each category of intelligence, is generally what serious education is about.”
## - Toni Morrison, author [The Source of Self-Regard](https://www.powells.com/book/-9780525521037?partnerID=44711)
---
class:center, middle
# Data Analysis Should Drive Decision Making
# This is what it means to be "data driven"
##
---
class:center, middle
# Data Analysis Should Drive Decision Making
# This is what it means to be "data driven"
## And good analysis should lead to good decisions
---
name:avp
# But It's Just One Part of a Process
![img-center-80](images/valuechain.png)
???
+ Facilitator introduces the idea of the Analytics Value Chain
---
class: middle
> "If you do not know how to ask the right question, you discover nothing."
## - W. Edward Deming
???
+ Sequeway to the brainstorming exercise (i.e. how do we learn to ask the right question)
---
name:brainstorm
# Our Method For Generating Ideas
???
+ Facilitator introduces the key method for brainstorming
--
![img-right-30](images/ideate.png)
+ **Ideate** - On your own, generate at least 3 ideas (ideally more), each on their own Post-It Note
####
![img-center-30](images/postit_ex.jpg)
---
# Our Method For Generating Ideas
![img-right-30](images/discuss.png)
+ **Ideate** - On your own, generate at least 3 ideas (ideally more), each on their own Post-It Note
+ **Discuss** - Review the ideas generated
---
# Our Method For Generating Ideas
![img-right-30](images/ideate_discuss_decide_crop.png)
+ **Ideate** - On your own, generate at least 3 ideas (ideally more), each on their own Post-It Note
+ **Discuss** - Review the ideas generated
+ **Decide** - Come to a consensus as a group
---
# Group Exercise - [NYC Vision Zero](http://www1.nyc.gov/site/visionzero/index.page)
???
## Task: Students practice developing key questions around an analytical task unrelated to NYC Parks to encourage broad participation and engagement
## Outcome: Students gain experience formulating and defining an analytical question to be explored in data not directly connected to the work they generally perform
## Format: Individual, small group, large group facilitated ideation, co-creation, and discussion
## Start by asking participants: what do you know about vision zero?
## Being reading slides by setting the scene: As the best analysts in NYC, we've all been recruited by the mayor into the vision zero task force.
--
+ Roughly 4,000 New Yorkers are seriously injured and more than 250 are killed each year in traffic crashes
--
+ On average, vehicles seriously injure or kill a New Yorker every two hours
--
+ There are over 6,000 linear street miles and over 19,000 total lane miles in New York City
--
+ .red[What analytic questions would you want answered to reduce and eliminate traffic fatalities in NYC?]
![img-center-40](images/ideate_discuss_decide_horiz_crop.png)
???
+ At conclusion of exercise or during the break, facilitator attempts to develop key themes out of the ideation along the lines of [bundled ideas](http://www.designkit.org/methods/30)
---
exclude:true
class:center,middle
# Let's Look at Some Data
## [Click to Download Motor Vehicle Collisions<br>1 January 2017 - 1 July 2017](data/20170101_20170701_NYPD_Motor_Vehicle_Collisions.xlsx)
---
class:center,middle
# Wrap-Up
![img-center-100](images/imp_guide.png)
???
+ Facilitator concludes the exercise soliciting questions and introduces the implementation guide at the back of their workbook
---
class:center,middle
# 15 MIN BREAK
---
# Process Mapping
--
+ Allows you to identify and strategize for key steps in your analysis
--
+ Helps sequence tasks and identify gaps in understanding
--
+ Provides a basis for documenting work
---
# Process Mapping
![img-center-100](images/Makebreakfast.gif)
.caption[By Scottsm1991 (Own work) [CC BY-SA 3.0](http://creativecommons.org/licenses/by-sa/3.0), via Wikimedia Commons]
???
+ Facilitator does the "I do" phase by describing a past project that was mapped out
---
# Process Mapping (Our method)
![img-center-50](images/pm_ex.jpg)
---
exclude:true
# Process Map
![img-center-100](images/process_map_ex.jpg)
.caption[Analyzing pet licensing compliance in San Jose. Image Credit: Datapolitan [CC BY-SA 4.0](http://creativecommons.org/licenses/by-sa/4.0/)]
---
# How to Create a Process Map
???
+ Facilitator introduces the method for creating a process map. The language is still in flux. Find the language that is most authentic to your understanding that resonates with the students
--
![img-right-30](images/ideate_discuss_decide_crop.png)
+ Identify the key challenge
--
+ Identify the outcome
--
+ Identify key way to validate the outcome (outputs)<br>
--
_**How do we know we've got it right?**_
---
# Outcomes vs Outputs
--
+ Outcomes are the larger benefits and/or achievements you're trying to realize<br>
--
(happiness, health, well-being, etc.)
--
+ Outputs are the tangible parts of your outcome<br>
--
(survey responses, measured results, etc.)
--
+ Outputs enable us to find outcomes
--
+ Without outcomes, there is no need for outputs
---
name:process-map-comp
# How to Create a Process Map
![img-right-30](images/ideate_discuss_decide_crop.png)
+ Identify the key challenge
+ Identify the outcome
+ Identify key way to validate the outcome (outputs)<br>_**How do we know we've got it right?**_
--
+ Identify the key inputs (data, partners, supplies, finances, etc.)
--
+ Sequence the key questions to turn inputs into outputs
---
# Key Tips
--
![img-right-30](images/ideate_discuss_decide_crop.png)
+ Place each step on a Post-It Note
####
![img-center-30](images/postit_ex.jpg)
---
# Key Tips
![img-right-30](images/ideate_discuss_decide_crop.png)
+ Place each step on a Post-It Note
+ Order and reorder as necessary
--
+ Some steps will need to be broken down
???
+ Facilitator identifies the key parts of the process based on the graphic presented
+ Faciliator reminds participants to follow the process that works best for them
---
exclude:true
# Building the Best Banana Split
![img-right-30](images/banana_split.png)
+ Identify the key result
+ Identify how to validate the result
+ Identify the key inputs
+ Brainstorm key steps to get inputs to the output
![img-center-45](images/ideate_discuss_decide_horiz_crop.png)
####
.caption[Top <a href="http://creativecommons.org/licenses/by-sa/3.0/" title="Creative Commons Attribution-Share Alike 3.0">CC BY-SA 3.0</a>, <a href="https://commons.wikimedia.org/w/index.php?curid=964364">Link</a>; Middle by <a href="https://en.wikipedia.org/wiki/User:2candle" class="extiw" title="wikipedia:User:2candle">2candle</a> at <a href="https://en.wikipedia.org/wiki/" class="extiw" title="wikipedia:">English Wikipedia</a>, <a href="http://creativecommons.org/licenses/by-sa/3.0/" title="Creative Commons Attribution-Share Alike 3.0">CC BY-SA 3.0</a>, <a href="https://commons.wikimedia.org/w/index.php?curid=6698038">Link</a>; Bottom by [Flickr user Kristin Ausk](https://www.flickr.com/photos/kristinausk/), [CC BY-SA 2.0](https://creativecommons.org/licenses/by-sa/2.0/), [Link](https://www.flickr.com/photos/kristinausk/6830445704/in/photolist-bpzPxf-nBDSBJ-nkbey1-awHNhS-nBDRNa-nBrAVL-nDs5g6-nkaBE4-nzCd8L-d5iYiE-TopM85-fFNjqk-nBn71t-nDrddF-7Up5jm-9z2RTb-cWbBBE-cWbwoS-956Wb4-2ghxXH-7KHKAH-fDiXMm-2gmZ7Q-nDra2F-nBn8bz-nkbhjj-nBEWaV-nBnXSc-gmDdW2-7KMHC3-cWbCm9-Sdbg5R-ig33dG-cWbEbq-8PHvG6-cWbqdY-dfNyUw-cWbADG-8dBA6U-8gGzb3-9f99Mr-cWbDoC-51uMvp-cWbuZA-obXW3z-51uMoF-51uNy4-ebAhe-51z23U-51uN5K)]
???
+ A simple "We do" example to get the participants started with the process
+ Participants verbalize with facilitator the steps as necessary to achieve the result
+ Facilitator models the steps where necessary to demonstrate the desired results
---
# Vision Zero
[![img-right-35](images/traffic.jpg)](https://www.flickr.com/photos/danichro/8034920810/)
--
+ Identify the key challenge
--
+ Identify the outcome
--
+ Identify the outcome measures (outputs)
--
+ Identify the key inputs
--
+ Sequence the key questions to turn inputs into outputs
.right-caption[[Image](https://www.flickr.com/photos/danichro/8034920810/) by [Flickr user Damianos Chronakis](https://www.flickr.com/photos/danichro/) [CC BY-SA 2.0](https://creativecommons.org/licenses/by-sa/2.0/)]
![img-center-35](images/ideate_discuss_decide_horiz_crop.png)
???
+ The more involved "We do" phase where we do this together
---
# The Data Analytics Value Chain
![img-center-80](images/valuechain.png)
---
# What did you notice about this process?
???
+ Facilitator reflects on the exercise and what participants are taking away from the experience
--
+ Knowing the problem and sequencing the steps can be harder than working with data
--
+ Having these answers can make the analysis much easier
--
+ This is a process you can do with any challenge, no matter how big
--
+ Everyone has something to add, no matter how technical (or non-technical) they are
---
class:center,middle
# LUNCH
---
name:lunch
class:center, middle
# WELCOME BACK!
---
class:center,middle
# Check-In
## What questions do you have?
---
# Thinking About NYC Parks
[![img-center-80](images/park.jpg)](https://www.flickr.com/photos/p-davidson/4483159007/)
.caption[Image Credit: [Beamish417](https://www.flickr.com/photos/p-davidson/), used under [CC BY-SA 2.0](https://creativecommons.org/licenses/by-sa/2.0/) license]
???
+ Facilitator shifts the frame of thinking to NYC Parks
---
class:center,middle
# Process Mapping with Parks Data
---
exclude:true
# Getting Started
+ Identify the project manager
+ Identify a recorder
???
+ Facilitator has groups identify key roles for working on the projects
+ Data driver should be the least experienced data person in the group to help facilitate learning
---
# Step 1 - Defining the Challenge
???
+ Students employ the same problem definition approach they used with the Vision Zero exercise to focus on an NYC Parks-specific problem
+ The intention is for them to utilize the skills we've modeled
+ Facilitator will seed questions for them to potentially think about
--
![img-right-30](images/ideate_discuss_decide_crop.png)
--
+ Think about a key challenge or issue in NYC Parks, that, if solved, could have an impact
--
+ Find something that interests you
--
+ .green[**Don't let the available data limit your exploration of the challenge**]
---
# Step 2 - Defining the Outcome
--
+ What will come out of your exploration of this problem?
--
+ How does it relate to your question/challenge?
--
+ How will the output benefit decision-makers?
---
# Step 3 - Define the Outputs
--
+ How will we know we've made an impact?
--
+ How will we tell the story of what we've done?
---
# Step 4 - Defining the Inputs
--
+ What sources of information can you use for this problem?
--
+ What other inputs might you need (buy-in, interviews, etc)
--
+ What is the relative priority of these to your project?
---
exclude:true
# Step 1b - Deciding the Type of Analysis
![img-center-60](images/nolytics_type_stacked.png)
+ "I want to know X so I can do Y"
---
exclude:true
# Step 2 - Collection
+ Identify data you'd want to work with
+ What would you expect your problem to look like in data?
---
exclude:true
# Step 3 - Cleaning
+ What do you think you'll have to do with the data?
+ How will you get this done?
---
exclude:true
# Step 4 - Test Hypothesis
+ What are your assumptions about the data?
+ How will you know if these are true?
+ How will you identify things you don't know?
---
exclude:true
# Step 5 - Verify Results
+ How will you check what you have?
+ How will you know if you're right?
---
exclude:true
# Step 6 - Visualize and Communicate
+ How will you tell the story of what you've found?
+ How will they best understand it?
---
exclude:true
+ Take the problem you've developed in your group and come up with a map for how to answer it
+ Acknowledge the knowns and unknowns
+ Develop an overall approach to answering the question
+ Don't be afraid to scale back your question if it seems too big
???
+ Students employ the same process mapping skills we've modeled with Vision Zero to their Parks-specific problem
+ The intention is to get them to think big and then scale back for time and ability with the data
---
class:center,middle
# Wrap-Up
???
+ Facilitator reflects with groups on what they've come up with so far and moves them into the computer lab, if appropriate
---
class:center,middle
# 15 MIN BREAK
---
exclude:true
class:center,middle
# Context for the data we're looking at today
???
+ Facilitator provides review of Parks-specific data
+ Often this is best done by asking if anyone is familiar with the data and having them share the details. Otherwise necessary details are listed on the slides
---
exclude:true
# Service Level Agreements (SLAs)
+ Minimum number of times a property must be serviced weekly
+ A: 5–7 visits
+ B: 3–5 visits
+ C: 1 visit
+ SLA level designated by borough
---
exclude:true
# Daily Tasks
+ Labor booking for routine litter and cleaning
![img-center-90](images/dt_web.png)
---
exclude:true
# Supervisor Inspections
+ All sites should be inspected following Parks Inspection Program guidelines a minimum of once a month
+ Sites receive an Overall rating for all elements
+ Sites also receive a Cleanliness rating
+ Cleanliness rating only addresses the features addressed by the staff entering work in Daily Tasks
---
class:center,middle
# Let's Get Some Data
---
exclude:true
# Definition of Open Data
> Open data is data that can be freely used, shared and built-on by anyone, anywhere, for any purpose
## - [Open Knowledge International](http://blog.okfn.org/2013/10/03/defining-open-data/)
---
exclude:true
# Key Features of Open Data
+ Availability and access
+ Reuse and redistribution
+ Universal participation
---
exclude:true
# Open Data Benefits
+ Transparency
+ Releasing social and commercial value
+ Participation and engagement
---
exclude:true
# Keeping NYC Accountable on Parking Tickets
![img-center-75](images/nyc_parking_ticket.jpg)
.caption[Image Credit: Parking Violator by [Atomische * Tom Giebel](https://www.flickr.com/photos/atomische/2299948817/), [CC BY-NC-ND 2.0 ](https://creativecommons.org/licenses/by-nc-nd/2.0/)]
Source: http://iquantny.tumblr.com/post/87573867759/success-how-nyc-open-data-and-reddit-saved-new
---
exclude:true
# Open Data Concerns
+ Privacy (Personally identifiable information (PII), Mosaic Effect)
+ Confidentiality
+ Security
---
exclude:true
# When Good Data Goes Bad
![img-center-70](images/nyctaxi.jpeg)
.caption[Image Credit: Kenny Louie, [CC BY 2.0](http://creativecommons.org/licenses/by/2.0), via [Wikimedia Commons](https://commons.wikimedia.org/wiki/File%3ANYC_TAXI_(7038011669).jpg)]
[Gawker matches Taxi and Limousine Data with Paparazzi Photos](http://gawker.com/the-public-nyc-taxicab-database-that-accidentally-track-1646724546)
---
class: center,middle
# We're going to be using data from the<br> <a href="https://opendata.cityofnewyork.us/" target="_blank">NYC Open Data Portal</a> for the first part of today's data exercise
## We'll be using [Vehicle Collision Data](data/20160401_0630_NYPD_Motor_Vehicle_Collisions.xlsx)
???
+ Facilitator introduces the NYC Open Data Portal as appropriate for the class and the time available.
---
exclude:true
# Data for Exercise
![img-center-90](images/dt_download_rev.png)
#### [Click to download if you have problems](data/20170601_20170630_DailyTasks_Work_GISPROPNUM_NotNull.csv)
???
+ If necessary, downloading the data straight from the link will help save time.
+ Note the data filters to remove the blank fields that mess up the data. Note this with the participants
---
exclude:true
class:center,middle
# Overview of Excel
---
exclude:true
# Benefits of Excel
+ Easy to use
+ Very visual
+ Lots of features and functions
+ Easy to make charts
+ Does a lot of formatting for you
???
+ Poll class for their key challenges working with Excel
+ Discuss the ways in which Excel is beneficial in daily work
---
exclude:true
# Drawbacks of Excel
+ Not very intuitive
+ Hard to find what you’re looking for (and they keep moving things around)
+ Lots of features and functions
+ Easy to make (bad) charts
+ Does a lot of formatting for you
???
+ Poll class for their key challenges working with Excel
+ Discuss the ways in which Excel is challenging in their daily work
---
exclude:true
# To Err is Human
> Since 1995, 88% of the 113 spreadsheets audited in 7 studies ... have contained errors.
## - Raymond Panko, "[What We Know About Spreadsheet Errors](http://panko.shidler.hawaii.edu/My%20Publications/Whatknow.htm)"
---
exclude:true
# Spreadsheet Errors Have Consequences
![img-center-80](images/athensprotest.jpg)
.caption[Image Credit: [http://underclassrising.net](https://www.flickr.com/photos/0742/4163565280), [CC BY-SA 2.0](https://creativecommons.org/licenses/by-sa/2.0/)]
---
exclude:true
# Finding Context
![img-center-80](images/context.png)
---
exclude:true
# Exploratory Data Analysis
+ Goal -> Discover patterns in the data
+ Understand the context
+ Summarize fields
+ Use graphical representations of the data
+ Explore outliers
####Tukey, J.W. (1977). Exploratory data analysis. Reading, MA: Addison-Wesley
---
exclude:true
# Steps to Prepare This Data
+ Filter for "Work" as the `activity`
+ Summarize by Park
+ Find the Borough for Each Site
+ Visualize Results
???
+ Task: Students follow a guided exercise in key data manipulation tasks using data related to the challenge, including
+ Formatting
+ Sorting/filtering
+ Aggregating (PivotTables)
+ Cleaning/manipulating
+ Visualizing
+ Outcome: Students are practiced in key data analytics task with Parks-specific data following the general outlines of the problem ideation and process mapping work they performed in the morning
+ Format: Guided exercise and facilitated discussion
+ Outline
+ Introduction to key analytical questions
+ Guided task demonstration
+ Wrap-up
---
exclude:true
# Filter the Activity Column
![img-center-80](images/filter1_rev.png)
---
exclude:true
# Filter the Activity Column
![img-right-40](images/filter2_rev_trim.png)
+ Uncheck "Select All"
+ Check "Work"
+ The color of row labels will change to show you're filtering rows
![img-left-50](images/filter3_box_rev.png)
---
class:center,middle
# Looking at this Data
## What Questions Do We Want to Ask?
???
+ Facilitator prompts participants to begin generating ideas to be motivate learning with the dataset
---
# PivotTables
--
+ A data summarization and comparison tool for quickly understanding and displaying the data you’re analyzing
--
+ Find it on the Insert ribbon
--
![img-center-60](images/pivottable1.png)
---
# PivotTables
![img-right-50](images/pivottable2.png)
+ Select the range and destination
--