A Scientific Way to Gauge Risk
When Gyan Prayaga 鈥21 and Jeev Prayaga 鈥22 learned about a data science competition designed to help their hometown, Los Angeles, reopen safely during the coronavirus pandemic, they eagerly signed up.
鈥淲e got to do something we thought would be helpful and impactful,鈥 says Gyan, a physics major. It was 鈥渁 good cause.鈥
Along with their parents, Rena Brar Prayaga and Ram Prayaga, the brothers formed a team called the Padron Peppers. 鈥淚t was kind of a family affair,鈥 Gyan says. 鈥淎 team of different people with different skills came together. It was really fun.鈥
The City of Los Angeles and several area companies hosted the 2020 COVID-19 Computational Challenge. Participating teams had access to open data resources such as geospatial data, COVID-19 data, and mobility data. They could also take free training on epidemiology, spatial analytics, and data science.
鈥淒ata science was kind of a new thing that we learned on the way,鈥 says Jeev, a computer science major. While their parents worked on the model, Jeev and Gyan developed the app. They had 2 weeks, from May 25 to June 8, 2020, to submit their solution.
鈥淥ur goal was to compute a risk score of getting COVID based on where you live and the activity you want to do,鈥 Jeev says. 鈥淭he idea was a user could go to this web site and say they wanted to go for a bike ride at a specific park at a specific location, and then we鈥檇 use a bunch of datasets to compute the relative risk of getting COVID.鈥
The Prayagas figured out the different risk factors they needed to consider to compute a risk score. One of the data sources they used was the Community Need Index, which showed various health factors in a given neighborhood. They also used the LA Times database for the number of COVID cases and deaths. Another database provided common activities and levels of risk. The whole team developed 2 sets of risk calculations鈥攐ne at the model stage and one in the application.
They took a compound approach to gauging risk. They considered the risk of the location, the transportation chosen for reaching that location, and the planned activity.
鈥淔or example, if you were to run or walk around your neighborhood, the risk is much less than playing soccer, because that鈥檚 a contact sport,鈥 Gyan says. But if you take a 20-minute bus ride to the park to play soccer, every minute on the bus also increases your risk of exposure.
They included additional risk factors in their calculations as well, such as the user鈥檚 age, because older people are at higher risk, and whether the user has any chronic, pre-existing conditions like heart disease or diabetes.
All that data went into the calculations. They computed a numeric score, which was translated into a qualitative scale: 鈥渧ery low,鈥 鈥渓ow,鈥 鈥渕edium,鈥 鈥渉igh,鈥 or 鈥渧ery high鈥 risk.
They also provided tips to help make the user鈥檚 chosen activity safer 鈥 like wearing a mask while riding a bus and using hand sanitizer after getting off.
The Padron Peppers鈥 solution was named 鈥渂est application鈥 and they have been invited to speak at a virtual conference called IM Data in November.