# Setting up PeopleMap

## Preparing your Dataset

* Now that you have completed the Getting Started section, it is time to begin the gathering data for your analysis.
* Using your computer's **.csv** editor, create a table with a single column titled **URL**
* To fill this .csv, put in the URLs of all the Google Scholar accounts of the researchers you'd like to include in your dataset
  * You can go to the Google Scholar search to find the individuals: <https://scholar.google.com/>
* Once you have filled your .csv with all the individual URLs you'd like to include, save it to the root directory of the project and title the file **ResearchersDataset**&#x20;

## Running the Bash File

* Next, you'll need to run the bash file for beginning the data-gathering and processing steps
* Navigate within the terminal to the root directory of the project
* After that, run the following command to begin the process:

```
./dataprocess.sh
```

* This will begin the process of:
  * Collecting author information related to their profile and publications on Google Scholar
  * Cleaning and preparing the data for analysis
  * Performing natural language processing algorithms to extract information related to the researcher dataset
* It roughly takes an hour for every 10 researchers included in the researcher dataset
* Make sure to keep your computer connected to the Internet while the process is completing

## Navigating to the PeopleMap platform

* Once the process finishes, a localhost link will be specified in the output, usually in the following form:

  ```
  - Local:      http://localhost:5000
  ```
* Copy the line segment of the form `http://localhost:5000` and paste it into the a web browser
  * It can be pasted into potentially any web browser but Google Chrome and Firefox tend to work best for the platform
* Now that you have the platform fully set up, feel free to begin analyzing and exploring the data!
  * To learn more about any of the tools, scroll down to the "What is PeopleMap" section of the webpage
* Need help debugging? Navigate to the FAQ page below:
