langchain/docs/modules/indexes/document_loaders/examples/imsdb.ipynb

95 lines
333 KiB
Plaintext
Raw Normal View History

2023-02-09 15:52:50 +00:00
{
"cells": [
{
"cell_type": "markdown",
"id": "cc9b809c",
"metadata": {},
"source": [
"# IMSDb\n",
"\n",
"This covers how to load IMSDb webpages into a document format that we can use downstream."
]
},
{
"cell_type": "code",
"execution_count": 1,
"id": "9d1f867e",
"metadata": {},
"outputs": [],
"source": [
"from langchain.document_loaders import IMSDbLoader"
]
},
{
"cell_type": "code",
"execution_count": 2,
"id": "84a32aa1",
"metadata": {},
"outputs": [],
"source": [
"loader = IMSDbLoader(\"https://imsdb.com/scripts/BlacKkKlansman.html\")"
]
},
{
"cell_type": "code",
"execution_count": 3,
"id": "8ae5ffe2",
"metadata": {},
"outputs": [],
"source": [
"data = loader.load()"
]
},
{
"cell_type": "code",
"execution_count": 4,
"id": "d41da111",
"metadata": {},
"outputs": [
{
"data": {
"text/plain": [
"[Document(page_content='\\n\\r\\n\\r\\n\\r\\n\\r\\n BLACKKKLANSMAN\\r\\n \\r\\n \\r\\n \\r\\n \\r\\n Written by\\r\\n\\r\\n Charlie Wachtel & David Rabinowitz\\r\\n\\r\\n and\\r\\n\\r\\n Kevin Willmott & Spike Lee\\r\\n\\r\\n\\r\\n\\r\\n\\r\\n\\r\\n\\r\\n\\r\\n\\r\\n FADE IN:\\r\\n \\r\\n SCENE FROM \"GONE WITH THE WIND\"\\r\\n \\r\\n Scarlett O\\'Hara, played by Vivian Leigh, walks through the\\r\\n Thousands of injured Confederate Soldiers pulling back to\\r\\n reveal the Famous Shot of the tattered Confederate Flag in\\r\\n \"Gone with the Wind\" as The Max Stein Music Score swells from\\r\\n Dixie to Taps.\\r\\n \\r\\n BEAUREGARD- KLAN NARRATOR (O.S.)\\r\\n They say they may have lost the\\r\\n Battle but they didn\\'t lose The War.\\r\\n Yes, Friends, We are under attack.\\r\\n \\r\\n CUT TO:\\r\\n \\r\\n A 1960\\'S EDUCATIONAL STYLE FILM\\r\\n \\r\\n Shot on Grainy COLOR 16MM EKTACHROME Film, The NARRATOR\\r\\n BEAUREGARD, a Middle Aged but handsome, White Male, sits at a\\r\\n desk, a Confederate Flag on a stand beside him. Very\\r\\n Official. He is not a Southerner and speaks with articulation\\r\\n and intelligence.\\r\\n \\r\\n BEAUREGARD- KLAN NARRATOR\\r\\n You\\'ve read about it in your Local\\r\\n Newspapers or seen it on The Evening\\r\\n News. That\\'s right. We\\'re living in\\r\\n an Era marked by the spread of\\r\\n Integration and Miscegenation.\\r\\n \\r\\n CUT TO:\\r\\n \\r\\n FOOTAGE OF THE LITTLE ROCK NINE\\r\\n \\r\\n being escorted into CENTRAL HIGH SCHOOL, Little Rock,\\r\\n Arkansas by The National Guard.\\r\\n \\r\\n BEAUREGARD- KLAN NARRATOR\\r\\n (V.O.)(CONT\\'D)\\r\\n The Brown Decision forced upon us by\\r\\n The Jewish controlled Puppets on the\\r\\n U.S. Supreme Court compelling White\\r\\n children to go to School with an\\r\\n Inferior Race is The Final Nail in a\\r\\n Black Coffin towards America becoming\\r\\n a Mongrel Nation.\\r\\n \\r\\n A QUICK SERIES OF IMAGES\\r\\n \\r\\n Segregation Signs. Antebellum Photos. Happy Slaves in Old\\r\\n Movies. Masters inspecting their Cotton and Tobacco with\\r\\n their Slaves in The Fields. Blacks shining Shoes and working\\r\\n as Butlers, Porters and Maids.\\r\\n BEAUREGARD- KLAN NARRATOR (V.O.)\\r\\n (CONT\\'D)\\r\\n We had a great way of Life before The\\r\\n Martin Luther Coon\\'s of The World...\\r\\n \\r\\n CUT TO:\\r\\n \\r\\n The Billboard of Dr. Martin Luther King Jr. sitting in the\\r\\n front row o
]
},
"execution_count": 4,
"metadata": {},
"output_type": "execute_result"
}
],
"source": [
"data"
]
},
{
"cell_type": "code",
"execution_count": null,
"id": "207bc39b",
"metadata": {},
"outputs": [],
"source": []
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3 (ipykernel)",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.8.1"
}
},
"nbformat": 4,
"nbformat_minor": 5
}