Vietnam War Casualties

Preface

Data Collection Methodology: The dataset is not publicly available on the internet. However, the website The Virtual Wall contains the data of all the Vietnam War Veterans that (1) Killed in Action, (2) Died of Wounds, or (3) Missing in Action. In order to obtain the entire dataset, I created a script to visit each page of the website and record the data as an observation. Due to some discrepancy in the different system, I was able to obtain 58,275 records of the Vietnam War Veterans that passed away. The dataset is found here. Among all of these observations, there are 52,583 observations that have a specific death province.

Data Cleaning and Processing: The original Jupyter Notebook can be found here. The dataset used in this visualization is found here.

Overview of U.S. Casualties

Casualty Distribution by Region: The choropleth displays patterns that result in effective comparisons between the different provinces. The main drawback of a choropleth is its inability to account for the difference in the area for each of the provinces. As a result, larger areas may appear more important, even if the data value is not proportionally larger. In order to account for regions with larger area, I rescaled the data by dividing the total death by the region area. As a result, the choropleth shows the death per area.

Casualty Distribution by Year: To display the progression of the death toll through the years, I created a plot to break down the total death for every province for each of the year during the war.


Observation

Total Death: The choropleth shows an increasing trend in the total death as the province gets closer to the border with North Vietnam and the DMZ Line. Notably, the death count is higher for regions near the border with Cambodida. Although we cannot make any conslusive statement about the visual pattern that we observe. We can suspect that this increase in U.S. Military death count can be attributed to the Ho Chi Minh trail leading to Saigon (Ho Chi Minh City). Even rescale the death to density, the general pattern holds while further emphasizes on the number of death per area in the provinces near North Vietnam.

Average Age: Visually, the choropleth of average age of death almost has the exact opposite trend as the total death choropleth. The provinces further away from the border with North Vietnam has a higher average age at death. This might mean that the provinces with higher average age at death have more administrative roles in the war and less combat action.

Average Service Length: Similar to the choropleth of average age at loss, the choropleth of average service length tends to be highest in the provinces further away from the border with North Vietnam and Cambodia. The visual similarity between the choropleth of average age and service length suggests that the soldiers that stationed in low-combat area tend to be older in age.

Total Death per Year: The choropleth over time shows the escalation in the U.S. military involvement in the Vietnam War up until the 1968. As quickly as it rises, the U.S. military total death also quickly decreases over the span of a few years.

Note

Many of the reported provinces are not in existence and replaced with a different name. Some other provinces are merged together. For a full list of provinces during the war, click here. I manually map the provinces to its current location based on the capital city.

All casualties occured in North Vietnam are reported as "North Vietnam" without any specific province, preventing me from mapping them to specific location on the map.

Citation

National Archives and Records Administration. (n.d.). Vietnam War U.S. military fatal casualty statistics. National Archives and Records Administration. https://www.archives.gov/research/military/vietnam-war/casualty-statistics

Shribman, David M. “David M. Shribman: A Tale of Two Wars.” Gazette, Pittsburgh Post-Gazette, 21 May 2020, www.post-gazette.com/opinion/david-shribman/2020/05/24/COVID-19-Vietnam-War-America-resiliency-body-count/stories/202005240034.