This project is a machine learning-based system that analyzes network traffic and identifies whether a connection is using a VPN or not. It is designed to help in understanding traffic behavior, enhancing network security, and detecting anonymity-based usage in monitored networks.
The system works by analyzing various features of network packetsβlike protocol type, packet lengths, and IP behaviorβto classify them into VPN or Non-VPN traffic. It includes data preprocessing, feature engineering, model training using Random Forest, and evaluation with accuracy metrics and visualizations.
network-traffic-analyzer-vpn/ β βββ src/ β βββ network_traffic_analyzer_vpn.ipynb # Main Jupyter notebook β βββ data/ # Dataset files (add here) βββ requirements.txt # Required Python packages βββ .gitignore # Files/folders to ignore βββ README.md # Project documentation
- π Analyzes real-world network traffic data
- π§ Detects VPN usage using Random Forest Classifier
- π Data preprocessing and visualization using Pandas, Seaborn, Matplotlib
- β Displays accuracy and classification results with confusion matrix
- Language: Python
- Notebook: Jupyter Notebook (
.ipynb) - Libraries:
pandas,numpyβ Data handlingmatplotlib,seabornβ Visualizationscikit-learnβ ML modeling and evaluation
- Clone the repository:
git clone https://github.com/your-username/network-traffic-analyzer-vpn.git cd network-traffic-analyzer-vpn