Entry Name: CSUS-Padgham-MC1

VAST Challenge 2019

Mini-Challenge 1

 

 

Team Members:

Quinlan Padgham (Primary), quinlanpadgham@csus.edu

Ashka Soni, asoni@csus.edu

Hector Rios, hrios@csus.edu

Hung Quach, hungvinhquach@csus.edu

Andy Zhu, azhu@csus.edu

Margaret Davis, margaretdavis@csus.edu

 

Faculty Adviser: Anna Baynes, shaverdian@csus.edu

 

Student Team: Yes

 

Tools Used:

Tableau

Trifacta

 

Approximately how many hours were spent working on this submission in total?

200 hours

 

May we post your submission in the Visual Analytics Benchmark Repository after VAST Challenge 2019 is complete? Yes

 

Video

https://www.youtube.com/watch?v=l8qB_UJNCj8

 

Questions

1 Ð Emergency responders will base their initial response on the earthquake shake map. Use visual analytics to determine how their response should change based on damage reports from citizens on the ground. How would you prioritize neighborhoods for response? Which parts of the city are hardest hit? Limit your response to 1000 words and 10 images.

 

https://lh6.googleusercontent.com/KKNZ9NEMI75rA6txrHf1PUNUP6Ih4uvyAddZfEIb93feXGsX1KT1upWOpFLzj7FHFvoh0xQGQibv9pN8_cjmQAw5uozqZ8vuHEf_mUczT0szkuDXl6eM-yX_Z4aAnPMEsBCGBtob

Figure 1: Medical Damage Over Different Districts

 

Figure 1 shows the sum of all reports towards medical damage at each point for each location in St. Himark. The purpose of this image is to evaluate the raw number of reports and their intensity at a quick glance, which helps gauge where focus should be spent at a high level. Areas with the greatest value have the largest raw sum of reported values, and are likely candidates for areas that have the most severely affected citizens. Adding the values of reports together helps to ignore oddities in reporting moment to moment where averages of reports or other similar metrics can vary wildly. This chart, along with others for the other reported categories, can be viewed here: Link

 

https://lh3.googleusercontent.com/iNzvQrGTejaOqo2c_Mo5kuL59DPkvwvJLLln8EqqdrDPD1rRn0RYrUZ4S3NAESDTWbjfyWSkwkq-zWNyShoaUBNCEoGhNn1F2kE6T2KhoLDL_nbQOxW64-7X8X2LMWoZ9WPGb1eR

Figure 2: Average Damages seen in Medical, Roads, and Shake Intensity over all Districts

 

Figure 2 guides responders to critical data aspects that may be important for them to react quickly to the situation. The chart gives vital information such as the damages to roads/bridges so that they may navigate better to those most in need. Medical is also highlighted in this table to see the medical centers and/or medical needs of people. Lastly shake intensity is included here to give responders a quick view of how each district reported its shake intensity. Link

 

https://lh6.googleusercontent.com/kgGWZJih_mLC-uQ5Kr9B10GKDYCFvnt04IOp4oNXNuEPZzAczRB8olam2wKlIHfFyefg_0HGskGUWA40etqXax7FV244LyvwoRDKIkgNKC7aHJoMXnfOo-O0zZNoaWAYzVfY48_Y

Figure 3: Reporting Quantities in Different Districts

 

Figure 3 displays each reporting element to the responders so they can be selective in what they want to focus on. From the averages of reporting of Buildings to Sewer/Water, the averages are displayed here. This table is meant to be adaptable to what the responders may want to see in the data, for example, certain events such as the relation of the reported power damage to the damage to buildings can be identified here. Link

 

https://lh5.googleusercontent.com/f8GUyuyowMZluJNdApiUWqKqAz_32YtRdKnQgDOM28q1MQJIWGz9tvbt-6Yhj1Sp1sJ-v5OB5W4RMVCSVg0ict2yGIefwwWPs1QIJ2_-h-cNy8llZ4MCLAmV39avtG5i5vkC8j37

Figure 4: The most impacted areas in the region.

 

Figure 4 shows the responders the level of severity reported in the overall reporting over time in each district. This helps guide responders to see the amount of reporting done for each district over time. The area seen in the table represents the proportion for each district. Overall, it is meant for responders to have a timeline which they can follow which highlights the reporting amounts for each district. Link 

 

https://lh6.googleusercontent.com/5cNS6ugzx42mcMXDjGZdWdvbEYJGQ5LFnzwEd3RKnfIgGIWTISzs90pKMmXTFfw-HTtEQlXlxyoxPZNtz4GKZAXXqQqWvGH9xrHH8XJwMLhyVosRGikdZzd3Bd87_8n1r_1dfVAW

Figure 5: City Influence in Reporting

 

Figure 5 shows responders the difference in reporting for districts that are affected by city projects that may affect reporting values. The blue area shows the sum of reports made that were not affected by city projects, while the gray area are the amounts of reporting in districts with city projects in them. This table helps responders see the increase in reporting in areas that may have been affected by city projects. Link

https://lh3.googleusercontent.com/GsNhkTS4UftJzZKWnbHVnsVusAey5ghpPsfbzA8VyNEchdwX3AH6K81RQqfyYnZ9sDB5z_CNdpJxCeqjN6dgVz-2uH-8NcOiNe5Y-2_JXuoveAhyV_tFwEk113H3cyKo6R_88s76 


https://lh6.googleusercontent.com/nLk-wO6cUKnAY7iHcSPJLvvBryaV8UJfMqRNv9IuXHfbm7dIYhmOg5IBZTlmfwJNTzDZUY6cZRwj7mnOHhJBbTeWmIJZvEO6AUC6KzdspHS3NVMTdi-Q5Fr8PF3RcihHrXvdugvP

 

Figure 6: Shake Intensity versus Building and Medical Report Quantities among Different Locations during Earthquake

 

In summary, he emergency responders can prioritize their neighborhoods based on the number of reports each location received. From the graphs above, we can see that location 3 is highly affected along with the day and time. The affected areas can be pointed out with the shake intensity that is high.  Link

 

 

2 Ð Use visual analytics to show uncertainty in the data. Compare the reliability of neighborhood reports. Which neighborhoods are providing reliable reports? Provide a rationale for your response. Limit your response to 1000 words and 10 images.

 

https://lh5.googleusercontent.com/vNUnimbrShf1mWYqMD-dvTx5NpqQGO60n50RKjY4jg3bHPZG5oO4qXtOC_IetM3qPrnL5bMqCkC6NwMlbmoswUM4lt5Av_N0SJ0eFL1rBenHE4-7RRQk4xD_fcjHfAbLbal4wulB

Figure 7: Map of Relative Volume of Reports

 

The map above shows the relative volumes of reports (the size of the squares for each area) and the average value reported (color and number below) for shake intensity. This helps give an idea of how much each group is reporting, and can be viewed in relation to the perceived severity. One location of note is region 8, along the southeast shore. They have a relatively high volume of reports given the relatively low value for shake intensity. https://lh6.googleusercontent.com/jsjlbX9hQcwXq85ScvFCThi1YhHI8vMCP8LVXpBlI-SdcQokyuTg37Ak4ddgx2FdaK9rq0sJjqwCvh4YUrjXxPaYrjETqyx7mtGTyuyNZ66dX5hGGqP5pACZaDr5IkBLpNKBRZ-3

Region 8 also had particular pattern of reporting. Whereas most regions at the beginning of a quake would have a sudden burst of reports that tapered off over time, region 8 instead often would start more slowly and build in intensity. This could suggest that region 8 had a spread of knowledge about the quake or potential damage and then would report it based on hearsay, rather than reporting first hand information.

 

https://lh6.googleusercontent.com/B17wHwMT1p_2m-9AwBDKXvnkXnx6lLI-INwMmeAa6gC_lk1zc2xchXl2Ip3tQsvwdpPLWNvFkLeh86A-Di-cdJvUesE3zixtkmMX50pPRhzkYUStWpwV92kxPuDO6FYunEtL_zod

https://lh6.googleusercontent.com/hHgQ-WQeBzQvtL-Kc46a9PfBC77RAEXCuTRvbIXWa0DcShz6NhOwDrVYvfD1ydzWby4Vb6XcdDd2uOouuqU04y4YEiQ6matB1_0ggO0j0EYYWZF4jisY4LPrKijkaXGeWtFFPRW7

 

The first image shows the number of reports by individual hours of the neighboring locations in the northern part of St. Himark that should be affected by the earthquake, based to the shakemap. The locations tend to follow the same reporting pattern throughout the day but most of the reporting from Location 3 is done in short intervals of 1-2 hours through all three days. The second image shows a positive correlation between shake intensity and damage among these locations. Location 3 stands out amongst the others by consistently having the greatest shake intensity and damage reported. It is also the only location which reports high shake intensity and damage across all days. The reason can be attributed to Location 3 being the historic center of the city with structures made of decorative brickwork which can be vulnerable to earthquakes. Location 3 also had ongoing utility projects, prior to the earthquake, in Power, Water and Sewer, and Roads and Bridges that could have affected the reporting. From the two images above, these neighboring locations seem to be providing reliable reports.

https://lh5.googleusercontent.com/xFRDE176OdOqUt_UarZuxpoaljdhulQ2Q5s3GT6ZVE5YdzGA_Lu4jjAWchd1ZdTzOgVaI27auHjUSrx8LMJhEkmXfWkI3cF2cDi6A6QkTkoFSMztvRbcV4nTet6UcZ2I3Qp-eOW4

The image above shows the number of reports by individual hour of neighboring locations in the southern part of St. Himark. Despite being the two furthest locations from the earthquake, Location 8 and 9 have the most amount of reports. Similarly, to Location 3, Locations 8, 9 and 10 also have a sudden increase in the amount of reports on a given hour that is not reported on the same hour by any other of the nearby locations. A possible explanation of the inconsistent reporting from location 10 would be attributed to the locations rural lifestyle where damages are less likely to be noticed immediately. However, based on the shakemap, Location 3 was directly affected by the earthquake while Location 8 and 9 was not. Based on the shakemap and the inconsistency of the reporting, it is doubtful whether the reports from Location 8 and 9 are reliable.

 https://lh6.googleusercontent.com/3anF3PRrhdwUIt8CZ_X1rRyF0SzxiyHKidzubN9Hq9-tPvbJrdJjC2ZN2J-r6sER5X2q3udQvFMncd_Ito7KGx8fKhWSkaBGaFQkCQz-pUQk0zfvnnh9JRja3c8uA6EV9l6k7-19

The image above shows there is a negative correlation between shake intensity and damage amongst the reports from the neighboring locations in the southern part of St. Himark. Locations 12 and 13 are the closest locations to the source of the earthquake and reported an average shake intensity between 3 and 5. Despite having lower shake intensity, Locations 8, 9, 10 and 11 have high reported damage across all categories and days than Locations 12 and 13. Based off the shakemap and the above image, there exists doubt in whether the reports from Locations 8, 9, 10 and 11 accurately represent the damage caused by the earthquake.

 

3 Ð How do conditions change over time? How does uncertainty in change over time? Describe the key changes you see. Limit your response to 500 words and 8 images.

 

https://lh6.googleusercontent.com/3FBsetWGuLlS6aCP-QvK7ShWqEAgXJn9Oyz3JcsB9TpKQPplaNInClZs9dp7QbZ2lQgKyKoq-45Muw8PthU6r7hNpTyBvtX-KR871wKZF7EvkLzBBBS-ZTG7DiRs_REfIAxQwBs_

 

The chart above shows the standard deviation of reports during different intervals of time throughout the event. The grouped times of reports labeled at the bottom are: Pre quake 1, quake 1, pre quake 2, quake 2, post quake 2, quake 3, and post quake 3. Quakes are considered the one hour interval from the first major grouping of reports for each quake, and post quake periods are the reports from that hour mark on until the next quake. During any quake, the reporting is extremely consistent compared to periods of time outside quakes, but on the whole consistency in reports between individuals rose as they experienced more quakes. This suggests that reports given quickly are, while perhaps inflated, more reliable than reports that happen a significant amount of time afterwards.

 

https://lh6.googleusercontent.com/HmjLC_mBwSjEPgF3afrBze7G5ZIBsuVO0C2HU0OTtQljNKPSMqZLVRjD81VyGEFNE0BVmfIZKLnz8OLK2kIPQzjw9Zb2slsmk0C8kPimoj_SooiAk3R4XH3g4VUeK4GSs3lR3p6y

 

This picture represents the median and average of how data has changed over time and the pattern of all data. The median and the average shares quitely the same fluctuation. However, there are two critical values showing uncertain data in the chart which are the location and medical. I think it could be an uncertain data in the excel. Also, the missing data on shaking intensity is cleaned by Trifacta, so the median data shows an accurate pattern when we compare with other variables. 

 

4 ÐÐ The data for this challenge can be analyzed either as a static collection or as a dynamic stream of data, as it would occur in a real emergency. Describe how you analyzed the data - as a static collection or a stream. How do you think this choice affected your analysis? Limit your response to 200 words and 3 images.

 

Since the responders are using the shake intensity for immediate decisions on how to deploy aid, we decided to provide a cumulative data analysis by treating the data as a static collection.  The benefits of this approach are that we have more data to verify the trends we discover.  Our goal for future work is to be able to provide the same level of insights but by treating the data as a stream of dynamic information.