diff --git a/README.md b/README.md index e5b54f7a6789a5d5176cdb0382fb6733c9a726ef..624bfbd2b587d6f8d177d8ac781079e75961207c 100644 --- a/README.md +++ b/README.md @@ -9,6 +9,8 @@ The `.sh` files are scripts meant to be executed on lara via slurm. They include The two python notebooks deal with dataset preparation. `stories-scraping.ipynb` scrapes prompts (inputs) and stories (outputs) from a [blog](https://blog.reedsy.com/short-stories/science-fiction/) where weekly writing contests are held, and saves them into a temporary `raw_stories` directory. The `stories-analysis.ipynb` performs statistical analysis on the scraped data and prepares the data for training - separates words and punctuation, shortens the stories to a desired length (1800 words) and separates the data into train, valid and testing datasets. It also deletes temporary directories. +Note: Trained model files are available on igor in my directory. (Link in report Appendices) + ### Model Reference