Exploring the Accessibility of COVID-19 Referral Hospitals in Metro Manila
This is a combination of my final project for Network Science with Sasha Garcia and Introduction to Geospatial Analysis with Vee Rivera.
Executive Summary
The COVID-19 pandemic emphasized the Philippines’ vulnerable healthcare system - especially for the COVID-19 referral hospitals. With the government’s commitment to democratize healthcare in the country, they need to either increase the capacity or build more healthcare facilities. In a 2015 article, the expense of putting up a hospital is estimated to cost around $9M. Taking into account inflation, the cost of building one today will cost even more. Hence, building more healthcare facilities without analyzing the current service area of existing facilities will be costly.
As such, we aim to assess the accessibility of the COVID-19 referral hospitals. We approach this problem by performing a service area analysis of these facilities in Metro Manila, building a bipartite network, performing simulation to identify the capacity of each, and solving a facility location problem to propose optimal location for new facilities.
As a result, we were able to identify unserved barangays which are not within a certain walking distance from a referral hospital. We were also able to calculate for node redundancy using the bipartite network and able to identify less redundant hospitals that may need additional support. Additionaly, our simulation provided metrics that would assist in assessing the needs of these facilities. Lastly, we are able to determine the optimal sites to establish new healthcare facilities to increase the coverage in Metro Manila.
Figure 1. Coverage Map of Hospitals (COVID-19 referral hospitals) in Metro Manila.
Isochrone maps are generated per facility per trip time (5-minute, 15-minute, 30-minute, 45-minute, 60-minute)
Data
1. Global Administrative Area Database (GADM) The Global Administrative Area Database (GADM) contains spatial data for countries in varying levels of detail. For this study, we focused on the barangay level, the lowest level of administrative division in the Philippines.
2. Philippine Census The Philippine Census contains data of the population on a barangay level. The most recent available data (2015 Census Data) was used in this study.
3. DOH COVID-19 Tracker The DOH COVID-19 Tracker is an open data initiative by the Department of Health as part of their efforts to fight against the pandemic as well as to promote public health. It contains COVID-19 related data such as number of new cases, recoveries, and deaths. The granularity of data is on a barangay level. In addition to this, it includes data on the COVID-19 referral hospitals which was used for this project.
Methodology
In this section, we will be discussing the methodology pipeline implemented in this project. A high-level overview is outlined as follows:
Figure 2. Methodology Pipeline.
Five main steps were done in order to address the problem statement and obtain the results.
Data Processing
Using the data discussed in the Data Section, the data was processed and prepared for analysis. In this subsection, we will be discussing how the extracted data were processed before performing the service area analysis, network analysis, simulation, and solving facility location problem.
Barangays in Metro Manila
With the data obtained from the Philippine Census, we have identified $1,691$ barangays in Metro Manila with a total population of $~13$ million.
Figure 3. Barangays in Metro Manila.
The barangays in Metro Manila were plotted using extracted polygons from the GADM dataset.
Additionally, since we want to simulate a demand point, we assumed that most of the population of a barangay is found in its center.
Figure 4. Barangays Centroids in Metro Manila.
The barangays centroids in Metro Manila were plotted using extracted polygons from the GADM dataset.
Road Network of Metro Manila
Performing the service area analysis would require the road network of Metro Manila. To extract this, we used the Python package osmnx
which allows users to extract geospatial data from OpenStreetMap.
Figure 5. Road Network of Metro Manila.
Approximately $117,000$ intersections were used as nodes and $319,000$ streets were used as edges to define the network.
Building Footprints of Makati City
Aside from the barangay centers as a demand point, we also want to have a more granular level of detail. Hence, we assumed that a building can also be a demand point. As such, we extracted building footprints. Since extracting building footprints for the entire Metro Manila will require a huge computing power, we limited our search to the city of Makati as a proof-of-concept. In extracting the building footprints in Makati City, we also used the Python package osmnx
.
Figure 6. Extracted polygon of building footprints in Makati City.
Approximately $55,295$ building footprints were used to simulate a demand pont.
To simply our analysis, we extracted each building’s centroid.
Figure 7. Building footprints in Makati City with a precision = $3$. The precision of decimal degrees allow us to unambiguously recognize neighborhood and streets, retaining important information for the analysis.
Referral Hospitals in Metro Manila
Using the data from the DOH COVID-19 tracker, we were able to scrape the locations of the referral hospitals using Google Places API. There are a total of 159 COVID-19 referral hospitals in Metro Manila with 874 ICU beds, 4,795 isolation beds, and 3,772 ward beds.
Figure 8. COVID-19 referral hospitals in Metro Manila.
The location of the facilities in Metro Manila were identified and plotted on top of the the extracted polygons from the GADM dataset.
Service Area Analysis
Using the road network, we are able to visualize how accessible the referral hospitals are. Here, we used the walk network type as this is the most accessible to the general population with the standard speed of walking set at $4$km/hr at different walking distances. This enables us to see the differences in accessibility given increasing walking duration. Assumptions that were used in order to perform the service area analysis are summarized in Table 1.
Table 1. Service Area Analysis Assumptions
Parameter | Assumption |
---|---|
Network Type | Walk |
Walking Speed | $4$km/hr |
Trip Time | $5$-, $15$-, $30$-, $45$-, $60$-min |
Once the service areas are generated, we can identify barangays that are not accessible per service area. To perform this, isochrone maps are generated to show areas accessible from a point within the trip times identified in Table 1. To generate isochrone maps, a combination of Network Science and Geospatial Analysis techniques were implemented. Using the extracted road network of Metro Manila, we implemented the algorithm below, which is based on a publicly available example of the usage of OSMNx.
A total of $795$ isochrone maps were generated per facility per threshold. Figure 9 shows the results for each trip time. Intuitively, a longer trip time will translate to a larger service area.
Figure 9. Coverage Map of COVID-19 Referral Hospitals in Metro Manila.
Isochrone maps are generated per hospital per trip time (From L to R: $5$-minute, $15$-minute, $30$-minute, $45$-minute, $60$-minute).
Hospital-Barangay Network
To create a hospital-barangay bipartite network, we assign the hospitals as the first set of nodes and the barangays as the second set of nodes. Nodes of the hospitals connect directly to the nodes of the barangays and there are no direct hospital-to-hospital or barangay-to-barangay connection. Figure 10 illustrates this network.
Figure 10. Hospital-Barangay Bipartite Network.
A bipartite network was used to represent the relationship between hospitals and barangays.
Here, a connection between a hopsital and barangay is established if the barangay is within $n$-minutes away from the hospital, with $n$ representing the different trip times. By using the results from the Service Area Analysis, we also generated $5$ bipartite networks based on varying trip times.
Figure 11. Hospital-Barangay Bipartite Network.
Connections between the hospitals and barangays gradually become more evident as trip times increase.
Capacity Simulation
Once the bipartite network has been established, we are able to use the network to simulate a COVID-19 scenario wherein patients from a barangay will seek treatment from hospitals within its network as defined by the bipartite network. As such, we can follow the progress of the capacity of the hospitals and the number of unserved cases through time. In the simulation, we use the $30$-minute bipartite network established in the Hospital-Barangay Network subsection and assume a random assignment of hospitals for each patient of a barangay within its hospital-barangay bipartite network. The assumptions are summarized in Table 2:
Table 2. Service Area Analysis Assumptions
Parameter | Assumption |
---|---|
% of Susceptible Individuals in a Barangay | $10%$ |
Probability that an Individual will Contract COVID-19 | $1%$ |
Probability that an Infected Person will need to be Hospitalized | $1.5$ |
Hospital Capacity | # of ICU beds + isolation beds + ward beds |
No. of Days in Hospital | $14$ days |
Days of Simulation | $30$ days |
In the simulation, we assign the size of the node to represent the hospital’s current capacity. Throughout the simulation, the nodes will increase its size to indicate that there have been an increasing number of patients coming into that hospital while the opposite will mean a decrease in patients in the hospital. Aside from this, a green node means that the hospital is still within capacity, while a red node means that the hospital is already at full capacity.
Facility Location Problem
The Facility Location Problem is concerned about the specific question: “How do we place our hospitals, clinics and other healthcare facilities such that the maximum number of people are within its coverage?” Such a problem fall under the Maximum Coverage Location Problem (MCLP). Which, under the assumption that the hospital sites are coordinated ahead of time, solves the optimal sites to place healthcare facilities to cover the greatest number of unserved population.
While there are many ways to solve an MCLP, including complex methods such as kernel density estimation (KDE) or
least significant difference (LSD), the simplest way is to solve it randomly. This is precisely the method used by Can Yang in their MCLP module mclp.py
found in their Github repository.
The algorithm is indeed simple, and loosely operates as follows: Given parameters:
- $M$ - Number of candidate sites
- $R$ - Radius of coverage of sites
- $K$ - Number of sites
Loosely execute as follows:
- Generate $M$ number of candidate sites in random locations in the vicinity of the concerned population
- Examine all permutations of $K$ number of sites and check for their total coverage with radius $R$
- Select the permutation with the greatest coverage of the concerned population
We selected $M = 1000$ candidate sites, which is more than enough for the concerned population. With this, the optimal sites do not make any significant changes, and consistently cover the same population between repeat runs.
Additionally, since we are essentially solving a resource allocation problem, we are also somewhat bound as to the number of sites, $K$, that we use. Ideally, we want to find the maximum coverage with minimum number of sites used.
Discussion of Results
In this section, we present the results of the service area analysis, hospital-barangay bipartite network, and the network simulation.
Service Area Analysis
As discussed in the Road Network of Metro Manila subsection, the isochrone maps show the areas serviced by the referral hospitals. Through this, we have identified the barangays which are not within a service area of the hospitals.
Figure 12. Unserved Barangays.
Each trip time shows unservered barangays in Metro Manila.
The results of the service area analysis are quantified in Table 3. Intuitively, the areas covered by referral hospitals increase as the trip time increase, it follows that the number of unserved barangays gradually decrease. Results also show that the unserved population decrease as the trip time increase.
Table 3. Service Area Analysis Results
Walking Duration | $5$ mins | $15$ mins | $30$ mins | $45$ mins | $60$ mins |
---|---|---|---|---|---|
Unserved Barangays | $1,228$ | $453$ | $37$ | $6$ | $3$ |
Unserved Population | $6.7$M | $2.6$M | $238$K | $25$K | $3$K |
Hospital-Barangay-Network
Intuitively, as the trip time increases, the number of connected hospitals and barangays increase as well. In addition to this, an important aspect of the hospital-barangay network is that we are able to calculate, on the average, the number of hospitals accessible per barangay and the number of barangays in the service area of a hospital. The results for all trip times are quantified in Table 4.
Table 4. Service Area Analysis Results
Walking Duration | $5$ mins | $15$ mins | $30$ mins | $45$ mins | $60$ mins |
---|---|---|---|---|---|
Average # of Hospitals Accessible per Barangay | $1$ | $2$ | $5$ | $10$ | $16$ |
Average # of Baranggays in the Service Area of a Hospital | $4$ | $17$ | $55$ | $108$ | $171$ |
Another important aspect of the hospital-barangay bipartite network is the ability to calculate for node redundancies. In essence, nodes with high redundancy have less impact to the overall network in the event that it is removed from the network. On the other hand, nodes that have a low redundancy bring more impact to the network if it is removed. Focusing on the $30$-minute bipartite network, we see that there are multiple nodes that have a low node redundancy. It is important to identify these as a starting point for resource allocation.
Figure 13. Node Redundancy.
The identification of less redundant nodes help highlight which nodes have high impact to the network when removed.
Capacity Simulation
Simulating the capacity of the referral hospitals given the assumptions stated in Table 4 has allowed us to capture metrics such as number of unserved cases per day and the number of hospitals at full capacity. Aside from this, we can also identify which hospitals have the most number of days at full capacity. We believe that these metrics will help asses the need to provide extra healthcare facilities in order to avoid overloading the hospitals, thus being able to accommodate all COVID-19 patients at any given time.
Figure 14. 30-Day Capacity Simulation of the Hospitals.
The simulation shows which hospitals are within capacity (green nodes) and which reached full capacity (red nodes).
Based on the simulation, the number of unserved cases have an increasing trend since Day 1. However, Figure 15 shows that there are also evident peaks that reflect surges in COVID-19 cases. Through this simulation, hospitals will be able to anticipate peaks or surge of cases so that they can better prepare and manage resources.
Figure 15. Number of Unserved Cases per Day.
Using the number of unserved cases per day as a metric will help anticipate peaks or surges to better manage resources.
Determining the number of hospitals at full capacity per day is a great indicator of how much healthcare facilities are lacking in NCR. Through this metric, we will be able to assess the need to provide healthcare facilities in order to ensure that hospitals will not reach their full capacity and close themselves off to potential patients. Results of this metric are illustrated in Figure 16.
Figure 16. Number of Hospitals at Full Capacity.
Using the number of hospitals at full capacity as a metric will help assess the need to provide extra healthcare facilities.
Lastly, it is also important that we are able to identify which hospitals are at full capacity and the corresponding days they remained at that status. Figure 17 shows the top $20$ hospitals that have the most number of days at full capacity. This can help in assessing the need to increase the hospitals capacity or to add other healthcare facilities within the area of these hospitals.
Figure 17. Number of Days at Full Capacity per Hospital.
Some hospitals are seen unable to recover given a certain number of days of being at full capacity.
Facility Location Problem
For the Facility Location Problem, we focused on the results of the $30$-minute threshold. Figure 18 shows the unserved location of barangays and building footprints. These are the demand points identified wherein we will try to recommend optimal location for new healthcare facilities which will be discussed in the succeeding section.
Figure 18. Unservered Areas Used for Facility Location Problem.
L:Unservered barangays in Metro Manila and R: Unserved building footrpints in Makati City.
Demand Point: Barangay
Shown in Figure 19 are the covered barangays vs hospital sites generated by the MCLP algorithm. The graph shows diminishing returns. By the 2nd hospital placed, the number of covered barangays is already ~$2/3$ of the total number of unserved barangays. Any hospitals placed beyond the second show significantly reduced coverage, with $8$ additional facilities completely covering all the unserved barangays.
Figure 19. Covered Barangays vs Hospital sites in the MCLP.
The coverage starts to diminish at $2$ sites.
The result of the MCLP algorithm for the first $4$ optimal sites is shown in Figure 20.
Figure 20. Optimal Sites from MCLP using Barangay as Demand.
Optimal sites by solving MCLP in GREEN, unserved barangays are in RED, and existing referral hospitals in PURPLE.
Demand Point: Building Footprints
A similar result is shown in the scale of Makati. Shown in Figure 21 is the coverage for $1$ to $4$ sites. This time, the population is much denser, so there is some inevitable overlap between sites. Notice how the initially placed sites shift when another is placed to avoid overlap. This is an inherent result of the algorithm as any overlap takes away maximization of the covered area since redundancy is not considered in the algorithm.
Figure 21. Optimal Sites from MCLP using Building Footprints as Demand.
Optimal sites by solving MCLP in GREEN, unserved barangays are in RED, and existing referral hospitals in PURPLE.
Now, shown in Figure 22 are the covered buildings vs hospital sites generated by the MCLP algorithm. Similar to the NCR scale, the graph shows diminishing returns, albeit at a lesser rate. By the 4th facility, the coverage is already $ ~ 4/5 $ of the total number of buildings. The lesser rate of diminishing returns is a result of the more densely packed population, as well as there being a greater number of demand points to consider. Even still, by the 9th facility placed, all the demand points are covered, save for a few edge cases.
Figure 22. Covered Buildings vs Hospital sites in the MCLP.
The coverage starts to diminish at $4$ sites.
Conclusion
With the continuous increase in COVID-19 cases in the Philippines, especially in the National Capital Region, it is important to bring focus to the facilities that help treat those COVID-19 infected individuals. By identifying referral hospital and their service areas, we were able to identify barangays that are not within n-minutes from COVID-19 referral hospitals. This allows us to further highlight the needs of the communities who are extremely challenged in our current situation.
Aside from this, using a hospital-barangay bipartite network, we were able to calculate node redundancies and highlight less redundant hospitals that may need additional support, be it additional resources or additional healthcare facilities in their area. By identifying these hospitals, we may be able to create short-term and long-term solutions that will further aid in addressing not only the current pandemic, but also future similar situations.
Using our simulation, we were able to capture metrics that can help our government make policies in terms of allocating resources. As we highlight hospitals that are mostly at full capacity, we believe that these can serve as triggers to assess the capability of hospitals to accommodate the communities around them and open up the opportunity to bring even more resources to communities and barangays that need them.
Lastly, we explored the uncovered areas and determined the optimal sites for new facilities. This was done through a Maximum Coverage Location Problem (MCLP) on two scales: the NCR-scale with barangay centroids as the demand points and Makati-scale with building centroids as the demand points. On both scales, the coverage vs number of sites showed diminishing returns, with the NCR scale thresholding by the 2nd site, and Makati scale thresholding by the 4th site.