Merge branch 'main' into single.poetry

2024-11-08 07:11:06 +00:00 · 2024-02-15 21:23:30 -03:00 · 2024-02-15 21:23:30 -03:00 · f0255d2d6e
commit f0255d2d6e
parent bd913c626b 58e6e277a6
6 changed files with 145 additions and 8 deletions
--- a/README.md
+++ b/README.md
@ -23,9 +23,6 @@

 </div>

-> [!NOTE]  
-> We are currently working on some client installation optimizations. If the instructions don't work, check back in a day or two and it should be sorted, and be even easier than before.
-
 ## Navigation

 - [What and Why](#what-and-why)
@ -435,6 +432,8 @@ The content features a conversation between two individuals discussing various t
 - _Caleb Sima_ for pushing me over the edge of whether to make this a public project or not.
 - _Joel Parish_ for super useful input on the project's Github directory structure.
 - _Jonathan Dunn_ for spectacular work on the soon-to-be-released universal client.
+- _Joseph Thacker_ for the idea of a `-c` context flag that adds pre-created context in the `./config/fabric/` directory to all Pattern queries.
+- _Jason Haddix_ for the idea of a stitch (chained Pattern) to filter content using a local model before sending on to a cloud model, i.e., cleaning customer data using `llama2` before sending on to `gpt-4` for analysis.

 ### Primary contributors

--- a/helpers/vm
+++ b/helpers/vm
@ -0,0 +1,86 @@
+#!/usr/bin/env python3
+
+import sys
+import re
+from googleapiclient.discovery import build
+from googleapiclient.errors import HttpError
+from youtube_transcript_api import YouTubeTranscriptApi
+from dotenv import load_dotenv
+import os
+import json
+import isodate
+import argparse
+
+def get_video_id(url):
+    # Extract video ID from URL
+    pattern = r'(?:https?:\/\/)?(?:www\.)?(?:youtube\.com\/(?:[^\/\n\s]+\/\S+\/|(?:v|e(?:mbed)?)\/|\S*?[?&]v=)|youtu\.be\/)([a-zA-Z0-9_-]{11})'
+    match = re.search(pattern, url)
+    return match.group(1) if match else None
+
+def main(url, options):
+    # Load environment variables from .env file
+    load_dotenv(os.path.expanduser('~/.config/fabric/.env'))
+
+    # Get YouTube API key from environment variable
+    api_key = os.getenv('YOUTUBE_API_KEY')
+    if not api_key:
+        print("Error: YOUTUBE_API_KEY not found in ~/.config/fabric/.env")
+        return
+
+    # Extract video ID from URL
+    video_id = get_video_id(url)
+    if not video_id:
+        print("Invalid YouTube URL")
+        return
+
+    try:
+        # Initialize the YouTube API client
+        youtube = build('youtube', 'v3', developerKey=api_key)
+
+        # Get video details
+        video_response = youtube.videos().list(
+            id=video_id,
+            part='contentDetails'
+        ).execute()
+
+        # Extract video duration and convert to minutes
+        duration_iso = video_response['items'][0]['contentDetails']['duration']
+        duration_seconds = isodate.parse_duration(duration_iso).total_seconds()
+        duration_minutes = round(duration_seconds / 60)
+
+        # Get video transcript
+        try:
+            transcript_list = YouTubeTranscriptApi.get_transcript(video_id)
+            transcript_text = ' '.join([item['text'] for item in transcript_list])
+            transcript_text = transcript_text.replace('\n', ' ')
+        except Exception as e:
+            transcript_text = "Transcript not available."
+
+        # Output based on options
+        if options.duration:
+            print(duration_minutes)
+        elif options.transcript:
+            print(transcript_text)
+        else:
+            # Create JSON object
+            output = {
+                "transcript": transcript_text,
+                "duration": duration_minutes
+            }
+            # Print JSON object
+            print(json.dumps(output))
+    except HttpError as e:
+        print("Error: Failed to access YouTube API. Please check your YOUTUBE_API_KEY and ensure it is valid.")
+
+if __name__ == '__main__':
+    parser = argparse.ArgumentParser(description='vm (video meta) extracts metadata about a video, such as the transcript and the video\'s duration. By Daniel Miessler.')
+    parser.add_argument('url', nargs='?', help='YouTube video URL')
+    parser.add_argument('--duration', action='store_true', help='Output only the duration')
+    parser.add_argument('--transcript', action='store_true', help='Output only the transcript')
+    args = parser.parse_args()
+
+    if args.url:
+        main(args.url, args)
+    else:
+        parser.print_help()
+
--- a/patterns/analyze_prose_json/system.md
+++ b/patterns/analyze_prose_json/system.md
@ -30,24 +30,28 @@ Common examples that meet this criteria:

 - Minor expansion on existing ideas, but in a way that's useful.

-"C - Incremental" -- Useful expansion or improvement of existing ideas, or a useful description of the past, but no expansion or creation of new ideas. Imagine a novelty score between 50% and 80% for this tier.
+"C - Incremental" -- Useful expansion or significant improvement of existing ideas, or a somewhat insightful description of the past, but no expansion on, or creation of, new ideas. Imagine a novelty score between 50% and 80% for this tier.

 Common examples that meet this criteria:

- Valuable collections of resources
- Descriptions of the past with offered observations and takeaways
+- Useful collections of resources.
+- Descriptions of the past with offered observations and takeaways.
+- Minor expansions on existing ideas.

 "D - Derivative" -- Largely derivative of well-known ideas. Imagine a novelty score between in the 20% to 50% range for this tier.

 Common examples that meet this criteria:

- Contains ideas or facts, but they're not new in any way.
+- Restatement of common knowledge or best practices.
+- Rehashes of well-known ideas without any new takes or expansions of ideas.
+- Contains ideas or facts, but they're not new or improved in any significant way.

 "F - Stale" -- No new ideas whatsoever. Imagine a novelty score below 20% for this tier.

 Common examples that meet this criteria:

- Random ramblings that say nothing new.
+- Completely trite and unoriginal ideas.
+- Heavily cliche or standard ideas.

 4. Evaluate the CLARITY of the writing on the following scale.

--- a/patterns/rate_value/README.md
+++ b/patterns/rate_value/README.md
@ -0,0 +1,3 @@
+# Credit
+
+Co-created by Daniel Miessler and Jason Haddix based on influences from Claude Shannon's Information Theory and Mr. Beast's insanely viral content techniques.
--- a/patterns/rate_value/system.md
+++ b/patterns/rate_value/system.md
@ -0,0 +1,45 @@
+# IDENTITY and PURPOSE
+
+You are an expert parser and rater of value in content. Your goal is to determine how much value a reader/listener is being provided in a given piece of content as measured by a new metric called Value Per Minute (VPM).
+
+Take a deep breath and think step-by-step about how best to achieve the best outcome using the STEPS below.
+
+# STEPS
+
+- Fully read and understand the content and what it's trying to communicate and accomplish.
+
+- Estimate the duration of the content if it were to be consumed naturally, using the algorithm below:
+
+1. Count the total number of words in the provided transcript.
+2. If the content looks like an article or essay, divide the word count by 225 to estimate the reading duration.
+3. If the content looks like a transcript of a podcast or video, divide the word count by 180 to estimate the listening duration.
+4. Round the calculated duration to the nearest minute.
+5. Store that value as estimated-content-minutes.
+
+- Extract all Instances Of Value being provided within the content. Instances Of Value are defined as:
+
+-- Highly surprising ideas or revelations.
+-- A giveaway of something useful or valuable to the audience.
+-- Untold and interesting stories with valuable takeaways.
+-- Sharing of an uncommonly valuable resource.
+-- Sharing of secret knowledge.
+-- Exclusive content that's never been revealed before.
+-- Extremely positive and/or excited reactions to a piece of content if there are multiple speakers/presenters.
+
+- Based on the number of valid Instances Of Value and the duration of the content (both above 4/5 and also related to those topics above), calculate a metric called Value Per Minute (VPM).
+
+# OUTPUT INSTRUCTIONS
+
+- Output a valid JSON file with the following fields for the input provided.
+
+{
+    estimated-content-minutes: "(estimated-content-minutes)";
+    value-instances: "(list of valid value instances)",
+    vpm: "(the calculated VPS score.)",
+    vpm-explanation: "(A one-sentence summary of less than 20 words on how you calculated the VPM for the content.)",
+}
+
+
+# INPUT:
+
+INPUT:
--- a/patterns/rate_value/user.md
+++ b/patterns/rate_value/user.md