mirror of
				https://github.com/simon987/od-database.git
				synced 2025-10-25 11:56:51 +00:00 
			
		
		
		
	OD-Database
Suggestions/concerns/PRs are welcome
Installation
Assuming you have Python 3 and git installed:
git clone https://github.com/simon987/od-database
cd od-database
sudo pip3 install -r requirements.txt
Create /config.py and fill out the parameters. Sample config:
# Leave default values for no CAPTCHAs
CAPTCHA_LOGIN = False
CAPTCHA_SUBMIT = False
CAPTCHA_SITE_KEY = ""
CAPTCHA_SECRET_KEY = ""
# Flask secret key for sessions
FLASK_SECRET = ""
RESULTS_PER_PAGE = (25, 50, 100, 250, 500, 1000)
# Headers for http crawler
HEADERS = {}
# Number of crawler instances (one per task)
CRAWL_SERVER_PROCESSES = 3
# Number of threads per crawler instance
CRAWL_SERVER_THREADS = 20
# Allow ftp websites in /submit
SUBMIT_FTP = False
# Allow http(s) websites in /submit
SUBMIT_HTTP = True
SERVER_URL = "http://localhost/api"
API_TOKEN = "5817926d-f2f9-4422-a411-a98f1bfe4b6c"
Running the crawl server
cd od-database
export PYTHONPATH=$(pwd)
cd crawl_server
python3 run.py
Running the web server (development)
cd od-database
python3 app.py
Running the web server with nginx (production)
- Install dependencies:
sudo apt install build-essential python-dev
sudo pip install uwsgi
- Adjust the path in od-database.ini
- Configure nginx (on Debian 9: /etc/nginx/sites-enabled/default):
server {
        ...
        include uwsgi_params;
        location / {
                uwsgi_pass 127.0.0.1:3031;
        }
        
        ...
}
- Start uwsgi:
uwsgi od-database.ini
Description
				
					Languages
				
				
								
								
									Python
								
								43%
							
						
							
								
								
									CSS
								
								29.4%
							
						
							
								
								
									HTML
								
								19.2%
							
						
							
								
								
									JavaScript
								
								8.3%