# FS-TFP This is the offical repository of **FedDGCN**: A Scalable Federated Learning Framework for Traffic Flow Prediction. ![overview](./figures/overview.jpg) It is also a the traffic flow prediction extension based on [FederatedScope](https://github.com/alibaba/FederatedScope). NOTE: This is an early version of **FedDGCN**. The full version will be updated after testing is completed. --- # 1. Environment We run the experiment on a **Linux system**, i.e **Ubuntu 22.04**. It has not been tested on other systems yet. ## Step 1. Create a Conda env We recommend using a **Conda** virtual environment. This project supports **Python 3.9** (recommended) and **Python 3.10**. **WARNING: Python 3.11 and later versions are not compatible!** ``` conda create -n FedDGCN python=3.9 conda activate FedDGCN ``` ## Step 2. Install Pytorch Download the appropriate version of [PyTorch]( https://pytorch.org/get-started/locally/) based on your device. This project has been tested with **Torch 2.4.0 (recommended)** and **Torch 2.0.0** with **CUDA 12**. Compatibility with other versions is not guaranteed. ## Step 3. Install FederatedScope git clone this repository, and ``` cd FS-TFP pip install -e . ``` Additionally, you might need to install some extra packages to avoid annoying warnings. ``` pip install torch_geometric community rdkit ``` # 2. Run the Code ## Step 1. Prepare the datasets You need to download the PeMS dataset from the **[STSGCN](https://github.com/Davidham3/STSGCN)** repository following README. After downloading, extract the dataset and place it in the `./data/trafficflow` directory at the root of the project. The directory structure of `./data/trafficflow` should be as follows: ``` FS-TFP\DATA\TRAFFICFLOW ├─PeMS03 ├─PeMS04 ├─PeMS07 └─PeMS08 ``` ## Step 2. Check your Setting We have placed the run scripts for the four datasets in the `./scripts/trafficflow_exp_scripts/` directory. There are YAML files for four datasets: `{D3, D4, D7, D8}.yaml`. You can customize the parameters or use the presets we provide. Some key parameters include: ``` # Line 3: Adjust the GPU device to use (for multi-GPU machines) device: 0 # Line 8: Adjust the total number of training rounds total_round_num: # Line 9: Adjust the number of clients based on your machine configuration client_num: # Line 65: Adjust the training loss function # Options: L1Loss, RMSE, MAPE criterion: type: ``` **WARNING:** Processing the **PEMSD7** dataset may require more than **32GB** RAM. If your system lacks sufficient RAM, it is recommended to increase the size of the swap partition. ## Step 3. Run the experiments You can use the following command to run **FedDGCN** directly. It is recommended to create the corresponding run configuration in your IDE based on the command below: ``` # PEMSD3 python federatedscope/main.py --cfg scripts/trafficflow_exp_scripts/D3.yaml # PEMSD4 python federatedscope/main.py --cfg scripts/trafficflow_exp_scripts/D4.yaml # PEMSD7 python federatedscope/main.py --cfg scripts/trafficflow_exp_scripts/D7.yaml # PEMSD8 python federatedscope/main.py --cfg scripts/trafficflow_exp_scripts/D8.yaml ``` If you see the following output in your terminal, congratulations! You have successfully run the experiment: ![image-20241121193843250](./figures/exp.png) # 3. Visualize the result The experiment logs will be placed in the **exp** folder. We have written a script, **global.py**, in the **exp** folder. You need to replace the previous logs with the new ones generated from the experiment. Once replaced, simply run the script to visualize the experiment results. ``` python exp/global.py ``` The script will generate a **baseline.jpg** file to visualize the logs. You are also free to modify the script to implement additional functionality as needed. You may install matplotlib first for drawing: ``` pip install matplotlib ``` # Citation TBD