From b0fc8eae1187269145160f091997f37bd752b505 Mon Sep 17 00:00:00 2001 From: Zala Sesko <zsesk001@gold.ac.uk> Date: Wed, 4 May 2022 11:28:20 +0000 Subject: [PATCH] =?UTF-8?q?=F0=9F=93=9D=20Update=20README.md?= MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit --- README.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/README.md b/README.md index e5b54f7..624bfbd 100644 --- a/README.md +++ b/README.md @@ -9,6 +9,8 @@ The `.sh` files are scripts meant to be executed on lara via slurm. They include The two python notebooks deal with dataset preparation. `stories-scraping.ipynb` scrapes prompts (inputs) and stories (outputs) from a [blog](https://blog.reedsy.com/short-stories/science-fiction/) where weekly writing contests are held, and saves them into a temporary `raw_stories` directory. The `stories-analysis.ipynb` performs statistical analysis on the scraped data and prepares the data for training - separates words and punctuation, shortens the stories to a desired length (1800 words) and separates the data into train, valid and testing datasets. It also deletes temporary directories. +Note: Trained model files are available on igor in my directory. (Link in report Appendices) + ### Model Reference -- GitLab