The program was created for Team project - development of data analysis system course run by @pbiecek at Warsaw University of Technology.
The aim of the created program is to download tweets from twitter by defined user or keyword. Downloading can be done in two modes:
- stream
- history
Full specification and more detailed description of summarization features (in Polish) can be found in this file.
- Download binary file from latest release
- Prepare
config.yaml - Run binaries
twitter-saverandweb
The programs can be run from command line with following arguments:
--config- path to config file
Config file is stored in YAML format. It contains following information:
db:hostportuserpassworddbName- database name in which data will be storedsslMode-enableordisable
web- web interface parameters:port- port on which server will be workingsecret- private key used to sign session identifiers. Should be random and renewed periodically. Keys shorter than 256 bits are not recommended.
users- sequence of user accounts. Each account consists of:usernamepassword
twitter- twitter API keys:consumerKeyconsumerSecrettokentokenSecret
json- defines additional fields from tweet saved in databaseall- saves whole tweet contentfields- sequence of field names to be stored
autoDeleteDays- number of days after which data will be automatically removed
- Piotr Krzeszewski
- Łukasz Ławniczak
- Artur Minorczyk
