This project analyzes species observations across parks, revealing trends and insights to aid conservation efforts and enhance our understanding of biodiversity.
- Introduction
- Datasets
- Data Cleaning
- Exploratory Data Analysis
- Visualizations
- Conclusions
- Future Work
- Installation
- Usage
- Contributing
- License
- Acknowledgements
This project aims to explore biodiversity data from various national parks, uncovering patterns and trends in species observations. The analysis focuses on understanding the distribution of species, their conservation status, and the overall biodiversity in these parks.
The project utilizes two main datasets:
- Observations Dataset: Contains information about species observations in different parks.
- Species Info Dataset: Provides details about species, including their category, common names, and conservation status.
- Merged datasets based on
scientific_name. - Handled missing values, particularly in the
conservation_statuscolumn. - Ensured consistency in species names and park names.
The analysis includes:
- Total number of observations per species.
- Number of observations per park.
- Distribution of observations by species category.
- Analysis of species by conservation status.
Several visualizations were created to aid the analysis:
- Bar chart of the number of observations per park.
- Bar chart of the distribution of species categories.
- Bar chart of species counts by conservation status.
- Bar chart of observations by conservation status.
The analysis provided valuable insights into the biodiversity across different parks, highlighting key trends and areas needing conservation focus.
- Perform temporal analysis to identify seasonal patterns.
- Conduct a detailed analysis of specific species with high or low observations.
- Examine park-specific conservation efforts.