Local Setup
This page describes how to prepare a local environment for the public VersoVector repository.
Clone the repository
git clone https://github.com/HubertRonald/VersoVector.git
cd VersoVectorPython version
The project was developed with Python 3.10.
A recommended setup is:
python3.10 -m venv .venv
source .venv/bin/activateOn Windows:
py -3.10 -m venv .venv
.\.venv\Scripts\Activate.ps1Install dependencies
Upgrade packaging tools:
python -m pip install --upgrade pip setuptools wheelInstall base dependencies:
pip install -r requirements.txtFor notebooks and development tools:
pip install -r requirements-dev.txtspaCy model
The preprocessing pipeline uses spaCy. Install the required English model:
python -m spacy download en_core_web_lgOptional UMAP dependencies
UMAP is optional because it may require additional compiled dependencies in some local environments.
pip install -r requirements-umap.txtIf UMAP installation fails, the project can still be inspected through the remaining notebooks and t-SNE/projection alternatives where available.
Dataset requirement
Before running the notebooks, download the main dataset:
data/PoetryFoundationData.csvSee Dataset for download options and expected file layout.
Jupyter kernel
Optional but recommended:
python -m ipykernel install --user \
--name versovector-py310 \
--display-name "Python 3.10.11 (VersoVector)"Clean generated files
Generated cache and heavy artifacts should not be committed.
find . -type d -name "__pycache__" -exec rm -rf {} +Generated model artifacts are expected to be rebuilt locally instead of versioned in Git.