mirror of
https://github.com/hwchase17/langchain
synced 2024-11-10 01:10:59 +00:00
224 lines
8.1 KiB
Plaintext
224 lines
8.1 KiB
Plaintext
{
|
||
"cells": [
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "683953b3",
|
||
"metadata": {},
|
||
"source": [
|
||
"# LanceDB\n",
|
||
"\n",
|
||
">[LanceDB](https://lancedb.com/) is an open-source database for vector-search built with persistent storage, which greatly simplifies retrevial, filtering and management of embeddings. Fully open source.\n",
|
||
"\n",
|
||
"This notebook shows how to use functionality related to the `LanceDB` vector database based on the Lance data format."
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": null,
|
||
"id": "bfcf346a",
|
||
"metadata": {
|
||
"tags": []
|
||
},
|
||
"outputs": [],
|
||
"source": [
|
||
"!pip install lancedb"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "markdown",
|
||
"id": "99134dd1-b91e-486f-8d90-534248e43b9d",
|
||
"metadata": {},
|
||
"source": [
|
||
"We want to use OpenAIEmbeddings so we have to get the OpenAI API Key. "
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": 2,
|
||
"id": "a0361f5c-e6f4-45f4-b829-11680cf03cec",
|
||
"metadata": {
|
||
"tags": []
|
||
},
|
||
"outputs": [
|
||
{
|
||
"name": "stdin",
|
||
"output_type": "stream",
|
||
"text": [
|
||
"OpenAI API Key: ········\n"
|
||
]
|
||
}
|
||
],
|
||
"source": [
|
||
"import os\n",
|
||
"import getpass\n",
|
||
"\n",
|
||
"os.environ[\"OPENAI_API_KEY\"] = getpass.getpass(\"OpenAI API Key:\")"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": null,
|
||
"id": "aac9563e",
|
||
"metadata": {
|
||
"tags": []
|
||
},
|
||
"outputs": [],
|
||
"source": [
|
||
"from langchain.embeddings import OpenAIEmbeddings\n",
|
||
"from langchain.vectorstores import LanceDB"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": 11,
|
||
"id": "a3c3999a",
|
||
"metadata": {},
|
||
"outputs": [],
|
||
"source": [
|
||
"from langchain.document_loaders import TextLoader\n",
|
||
"from langchain.text_splitter import CharacterTextSplitter\n",
|
||
"\n",
|
||
"loader = TextLoader(\"../../../state_of_the_union.txt\")\n",
|
||
"documents = loader.load()\n",
|
||
"\n",
|
||
"documents = CharacterTextSplitter().split_documents(documents)\n",
|
||
"\n",
|
||
"embeddings = OpenAIEmbeddings()"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": 12,
|
||
"id": "6e104aee",
|
||
"metadata": {},
|
||
"outputs": [],
|
||
"source": [
|
||
"import lancedb\n",
|
||
"\n",
|
||
"db = lancedb.connect(\"/tmp/lancedb\")\n",
|
||
"table = db.create_table(\n",
|
||
" \"my_table\",\n",
|
||
" data=[\n",
|
||
" {\n",
|
||
" \"vector\": embeddings.embed_query(\"Hello World\"),\n",
|
||
" \"text\": \"Hello World\",\n",
|
||
" \"id\": \"1\",\n",
|
||
" }\n",
|
||
" ],\n",
|
||
" mode=\"overwrite\",\n",
|
||
")\n",
|
||
"\n",
|
||
"docsearch = LanceDB.from_documents(documents, embeddings, connection=table)\n",
|
||
"\n",
|
||
"query = \"What did the president say about Ketanji Brown Jackson\"\n",
|
||
"docs = docsearch.similarity_search(query)"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": 14,
|
||
"id": "9c608226",
|
||
"metadata": {},
|
||
"outputs": [
|
||
{
|
||
"name": "stdout",
|
||
"output_type": "stream",
|
||
"text": [
|
||
"They were responding to a 9-1-1 call when a man shot and killed them with a stolen gun. \n",
|
||
"\n",
|
||
"Officer Mora was 27 years old. \n",
|
||
"\n",
|
||
"Officer Rivera was 22. \n",
|
||
"\n",
|
||
"Both Dominican Americans who’d grown up on the same streets they later chose to patrol as police officers. \n",
|
||
"\n",
|
||
"I spoke with their families and told them that we are forever in debt for their sacrifice, and we will carry on their mission to restore the trust and safety every community deserves. \n",
|
||
"\n",
|
||
"I’ve worked on these issues a long time. \n",
|
||
"\n",
|
||
"I know what works: Investing in crime preventionand community police officers who’ll walk the beat, who’ll know the neighborhood, and who can restore trust and safety. \n",
|
||
"\n",
|
||
"So let’s not abandon our streets. Or choose between safety and equal justice. \n",
|
||
"\n",
|
||
"Let’s come together to protect our communities, restore trust, and hold law enforcement accountable. \n",
|
||
"\n",
|
||
"That’s why the Justice Department required body cameras, banned chokeholds, and restricted no-knock warrants for its officers. \n",
|
||
"\n",
|
||
"That’s why the American Rescue Plan provided $350 Billion that cities, states, and counties can use to hire more police and invest in proven strategies like community violence interruption—trusted messengers breaking the cycle of violence and trauma and giving young people hope. \n",
|
||
"\n",
|
||
"We should all agree: The answer is not to Defund the police. The answer is to FUND the police with the resources and training they need to protect our communities. \n",
|
||
"\n",
|
||
"I ask Democrats and Republicans alike: Pass my budget and keep our neighborhoods safe. \n",
|
||
"\n",
|
||
"And I will keep doing everything in my power to crack down on gun trafficking and ghost guns you can buy online and make at home—they have no serial numbers and can’t be traced. \n",
|
||
"\n",
|
||
"And I ask Congress to pass proven measures to reduce gun violence. Pass universal background checks. Why should anyone on a terrorist list be able to purchase a weapon? \n",
|
||
"\n",
|
||
"Ban assault weapons and high-capacity magazines. \n",
|
||
"\n",
|
||
"Repeal the liability shield that makes gun manufacturers the only industry in America that can’t be sued. \n",
|
||
"\n",
|
||
"These laws don’t infringe on the Second Amendment. They save lives. \n",
|
||
"\n",
|
||
"The most fundamental right in America is the right to vote – and to have it counted. And it’s under assault. \n",
|
||
"\n",
|
||
"In state after state, new laws have been passed, not only to suppress the vote, but to subvert entire elections. \n",
|
||
"\n",
|
||
"We cannot let this happen. \n",
|
||
"\n",
|
||
"Tonight. I call on the Senate to: Pass the Freedom to Vote Act. Pass the John Lewis Voting Rights Act. And while you’re at it, pass the Disclose Act so Americans can know who is funding our elections. \n",
|
||
"\n",
|
||
"Tonight, I’d like to honor someone who has dedicated his life to serve this country: Justice Stephen Breyer—an Army veteran, Constitutional scholar, and retiring Justice of the United States Supreme Court. Justice Breyer, thank you for your service. \n",
|
||
"\n",
|
||
"One of the most serious constitutional responsibilities a President has is nominating someone to serve on the United States Supreme Court. \n",
|
||
"\n",
|
||
"And I did that 4 days ago, when I nominated Circuit Court of Appeals Judge Ketanji Brown Jackson. One of our nation’s top legal minds, who will continue Justice Breyer’s legacy of excellence. \n",
|
||
"\n",
|
||
"A former top litigator in private practice. A former federal public defender. And from a family of public school educators and police officers. A consensus builder. Since she’s been nominated, she’s received a broad range of support—from the Fraternal Order of Police to former judges appointed by Democrats and Republicans. \n",
|
||
"\n",
|
||
"And if we are to advance liberty and justice, we need to secure the Border and fix the immigration system. \n",
|
||
"\n",
|
||
"We can do both. At our border, we’ve installed new technology like cutting-edge scanners to better detect drug smuggling. \n",
|
||
"\n",
|
||
"We’ve set up joint patrols with Mexico and Guatemala to catch more human traffickers. \n",
|
||
"\n",
|
||
"We’re putting in place dedicated immigration judges so families fleeing persecution and violence can have their cases heard faster.\n"
|
||
]
|
||
}
|
||
],
|
||
"source": [
|
||
"print(docs[0].page_content)"
|
||
]
|
||
},
|
||
{
|
||
"cell_type": "code",
|
||
"execution_count": null,
|
||
"id": "a359ed74",
|
||
"metadata": {},
|
||
"outputs": [],
|
||
"source": []
|
||
}
|
||
],
|
||
"metadata": {
|
||
"kernelspec": {
|
||
"display_name": "Python 3 (ipykernel)",
|
||
"language": "python",
|
||
"name": "python3"
|
||
},
|
||
"language_info": {
|
||
"codemirror_mode": {
|
||
"name": "ipython",
|
||
"version": 3
|
||
},
|
||
"file_extension": ".py",
|
||
"mimetype": "text/x-python",
|
||
"name": "python",
|
||
"nbconvert_exporter": "python",
|
||
"pygments_lexer": "ipython3",
|
||
"version": "3.10.6"
|
||
}
|
||
},
|
||
"nbformat": 4,
|
||
"nbformat_minor": 5
|
||
}
|