Spaces:

ssalb
/

story_generator

Paused

App Files Files Community

ssalb commited on 2 days ago

Commit

139df7a

1 Parent(s): 7c0d92c

Update space with latest code and dependencies on Mon Jan 13 19:33:51 UTC 2025

Browse files

Files changed (4) hide show

README.md +3 -3
requirements.txt +11 -11
story_beam_search/scoring.py +1 -1
story_beam_search/stories_generator.py +1 -1

README.md CHANGED Viewed

@@ -9,7 +9,7 @@ sdk_version: 5.9.1
 app_file: app.py
 pinned: false
 preload_from_hub:
-- openai-community/gpt2
 - answerdotai/ModernBERT-base
 - facebook/bart-large-mnli
 license: mit
@@ -17,8 +17,8 @@ license: mit
 ## Project Overview
-The Story Generator project leverages advanced natural language processing models to generate coherent and engaging stories. By utilizing models such as GPT-2, BERT, and BART, this project aims to provide users with a tool to create narratives based on given prompts. The application is built using Gradio for an interactive user interface, making it easy to input prompts and receive generated stories in real-time.
-The main purpose of this project is to explore the idea of beam search for selecting stories with high coherence, fluency, and genre alignment scores. This ensures that the generated stories are not only creative but also maintain a logical flow and adhere to the specified genre.
 Note that the final implementation is not strictly beam search and was modified to allow more diversity (creativity) inspired by the DVTS method in [this blog post](https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute).

 app_file: app.py
 pinned: false
 preload_from_hub:
+- HuggingFaceTB/SmolLM2-135M-Instruct
 - answerdotai/ModernBERT-base
 - facebook/bart-large-mnli
 license: mit
 ## Project Overview
+This Story Generator leverages natural language processing models to generate coherent and engaging stories. By utilizing models such as SmolLMv2, BERT, and BART, this project aims to provide users with a tool to create narratives based on given prompts. The application is built using Gradio for an interactive user interface, making it easy to input prompts and receive generated stories in real-time.
+The main purpose of this project is to explore the idea of beam search for selecting stories with high coherence, fluency, and genre alignment scores in a process-based reward model (PRM) fashion. This ensures that the generated stories are not only creative but also maintain a logical flow and adhere to the specified genre.
 Note that the final implementation is not strictly beam search and was modified to allow more diversity (creativity) inspired by the DVTS method in [this blog post](https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute).

requirements.txt CHANGED Viewed

@@ -1,7 +1,7 @@
 accelerate==1.2.1 ; python_full_version == "3.10.13"
 aiofiles==23.2.1 ; python_full_version == "3.10.13"
 annotated-types==0.7.0 ; python_full_version == "3.10.13"
-anyio==4.7.0 ; python_full_version == "3.10.13"
 certifi==2024.12.14 ; python_full_version == "3.10.13"
 charset-normalizer==3.4.1 ; python_full_version == "3.10.13"
 click==8.1.8 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
@@ -16,7 +16,7 @@ gradio==5.9.1 ; python_full_version == "3.10.13"
 h11==0.14.0 ; python_full_version == "3.10.13"
 httpcore==1.0.7 ; python_full_version == "3.10.13"
 httpx==0.28.1 ; python_full_version == "3.10.13"
-huggingface-hub==0.27.0 ; python_full_version == "3.10.13"
 idna==3.10 ; python_full_version == "3.10.13"
 jinja2==3.1.5 ; python_full_version == "3.10.13"
 joblib==1.4.2 ; python_full_version == "3.10.13"
@@ -26,16 +26,16 @@ mdurl==0.1.2 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
 mpmath==1.3.0 ; python_full_version == "3.10.13"
 networkx==3.4.2 ; python_full_version == "3.10.13"
 numpy==2.2.1 ; python_full_version == "3.10.13"
-orjson==3.10.13 ; python_full_version == "3.10.13"
 packaging==24.2 ; python_full_version == "3.10.13"
 pandas==2.2.3 ; python_full_version == "3.10.13"
 pillow==11.1.0 ; python_full_version == "3.10.13"
-protobuf==5.29.2 ; python_full_version == "3.10.13"
 psutil==6.1.1 ; python_full_version == "3.10.13"
 pydantic-core==2.27.2 ; python_full_version == "3.10.13"
-pydantic==2.10.4 ; python_full_version == "3.10.13"
 pydub==0.25.1 ; python_full_version == "3.10.13"
-pygments==2.18.0 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
 python-dateutil==2.9.0.post0 ; python_full_version == "3.10.13"
 python-multipart==0.0.20 ; python_full_version == "3.10.13"
 pytz==2024.2 ; python_full_version == "3.10.13"
@@ -43,11 +43,11 @@ pyyaml==6.0.2 ; python_full_version == "3.10.13"
 regex==2024.11.6 ; python_full_version == "3.10.13"
 requests==2.32.3 ; python_full_version == "3.10.13"
 rich==13.9.4 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
-ruff==0.8.5 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
 safehttpx==0.1.6 ; python_full_version == "3.10.13"
-safetensors==0.5.0 ; python_full_version == "3.10.13"
-scikit-learn==1.6.0 ; python_full_version == "3.10.13"
-scipy==1.15.0 ; python_full_version == "3.10.13"
 semantic-version==2.10.0 ; python_full_version == "3.10.13"
 shellingham==1.5.4 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
 six==1.17.0 ; python_full_version == "3.10.13"
@@ -59,7 +59,7 @@ tokenizers==0.21.0 ; python_full_version == "3.10.13"
 tomlkit==0.13.2 ; python_full_version == "3.10.13"
 torch==2.4.0 ; python_full_version == "3.10.13"
 tqdm==4.67.1 ; python_full_version == "3.10.13"
-transformers @ git+https://github.com/huggingface/transformers.git@e5fd865ebae062b7cf03a81b8c6affeb39f30bec ; python_full_version == "3.10.13"
 triton==3.0.0 ; platform_system == "Linux" and platform_machine == "x86_64" and python_full_version == "3.10.13"
 typer==0.15.1 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
 typing-extensions==4.12.2 ; python_full_version == "3.10.13"

 accelerate==1.2.1 ; python_full_version == "3.10.13"
 aiofiles==23.2.1 ; python_full_version == "3.10.13"
 annotated-types==0.7.0 ; python_full_version == "3.10.13"
+anyio==4.8.0 ; python_full_version == "3.10.13"
 certifi==2024.12.14 ; python_full_version == "3.10.13"
 charset-normalizer==3.4.1 ; python_full_version == "3.10.13"
 click==8.1.8 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
 h11==0.14.0 ; python_full_version == "3.10.13"
 httpcore==1.0.7 ; python_full_version == "3.10.13"
 httpx==0.28.1 ; python_full_version == "3.10.13"
+huggingface-hub==0.27.1 ; python_full_version == "3.10.13"
 idna==3.10 ; python_full_version == "3.10.13"
 jinja2==3.1.5 ; python_full_version == "3.10.13"
 joblib==1.4.2 ; python_full_version == "3.10.13"
 mpmath==1.3.0 ; python_full_version == "3.10.13"
 networkx==3.4.2 ; python_full_version == "3.10.13"
 numpy==2.2.1 ; python_full_version == "3.10.13"
+orjson==3.10.14 ; python_full_version == "3.10.13"
 packaging==24.2 ; python_full_version == "3.10.13"
 pandas==2.2.3 ; python_full_version == "3.10.13"
 pillow==11.1.0 ; python_full_version == "3.10.13"
+protobuf==5.29.3 ; python_full_version == "3.10.13"
 psutil==6.1.1 ; python_full_version == "3.10.13"
 pydantic-core==2.27.2 ; python_full_version == "3.10.13"
+pydantic==2.10.5 ; python_full_version == "3.10.13"
 pydub==0.25.1 ; python_full_version == "3.10.13"
+pygments==2.19.1 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
 python-dateutil==2.9.0.post0 ; python_full_version == "3.10.13"
 python-multipart==0.0.20 ; python_full_version == "3.10.13"
 pytz==2024.2 ; python_full_version == "3.10.13"
 regex==2024.11.6 ; python_full_version == "3.10.13"
 requests==2.32.3 ; python_full_version == "3.10.13"
 rich==13.9.4 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
+ruff==0.9.1 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
 safehttpx==0.1.6 ; python_full_version == "3.10.13"
+safetensors==0.5.2 ; python_full_version == "3.10.13"
+scikit-learn==1.6.1 ; python_full_version == "3.10.13"
+scipy==1.15.1 ; python_full_version == "3.10.13"
 semantic-version==2.10.0 ; python_full_version == "3.10.13"
 shellingham==1.5.4 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
 six==1.17.0 ; python_full_version == "3.10.13"
 tomlkit==0.13.2 ; python_full_version == "3.10.13"
 torch==2.4.0 ; python_full_version == "3.10.13"
 tqdm==4.67.1 ; python_full_version == "3.10.13"
+transformers==4.48.0 ; python_full_version == "3.10.13"
 triton==3.0.0 ; platform_system == "Linux" and platform_machine == "x86_64" and python_full_version == "3.10.13"
 typer==0.15.1 ; sys_platform != "emscripten" and python_full_version == "3.10.13"
 typing-extensions==4.12.2 ; python_full_version == "3.10.13"

story_beam_search/scoring.py CHANGED Viewed

@@ -71,7 +71,7 @@ class CoherenceScorer(StoryScorer):
                 outputs = self.model(**inputs, output_hidden_states=True)
                 batch_embeddings = outputs.hidden_states[-1][
                     :, 0, :
-                ]  # Get CLS token embeddings
                 all_embeddings.extend(batch_embeddings.cpu().numpy())
         # Calculate coherence scores for each story

                 outputs = self.model(**inputs, output_hidden_states=True)
                 batch_embeddings = outputs.hidden_states[-1][
                     :, 0, :
+                ]
                 all_embeddings.extend(batch_embeddings.cpu().numpy())
         # Calculate coherence scores for each story

story_beam_search/stories_generator.py CHANGED Viewed

@@ -16,7 +16,7 @@ auth_token = os.getenv("HF_TOKEN", None)
 @dataclass
 class ModelConfig:
-    text_model_name: str = "openai-community/gpt2"
     bert_name: str = "answerdotai/ModernBERT-base"
     zero_shot_name: str = "facebook/bart-large-mnli"
     device: str = (

 @dataclass
 class ModelConfig:
+    text_model_name: str = "HuggingFaceTB/SmolLM2-135M-Instruct"
     bert_name: str = "answerdotai/ModernBERT-base"
     zero_shot_name: str = "facebook/bart-large-mnli"
     device: str = (