gpt4all/gpt4all-bindings/python/docs/gpt4all_chat.md

# GPT4All Chat UI

The [GPT4All Chat Client](https://gpt4all.io) lets you easily interact with any local large language model.

It is optimized to run 7-13B parameter LLMs on the CPU's of any computer running OSX/Windows/Linux.


## Plugins
GPT4All Chat Plugins allow you to expand the capabilities of Local LLMs. All plugins are compatible with the
chat clients server mode.

### LocalDocs Plugin (Chat With Your Data, PrivateGPT)
LocalDocs is a GPT4All plugin that allows you to chat with your local files and data.
It allows you to utilize powerful local LLMs to chat with private data without any data leaving your computer or server.
When using LocalDocs, your LLM will cite the sources that most likely contributed to a given output. Note, even an LLM equipped with LocalDocs can hallucinate.

#### Enabling LocalDocs
1. Install the latest version of GPT4All Chat from [GPT4All Website](https://gpt4all.io).
2. Go to `Settings > LocalDocs tab`.
3. Configure a collection (folder) on your computer that contains the files your LLM should have access to. You can alter the contents of the folder/directory at anytime. As you
add more files to your collection, your LLM will dynamically be able to access them.
4. Spin up a chat session with any LLM (including external ones like ChatGPT but warning data will leave your machine!)
5. At the top right, click the database icon and select which collection you want your LLM to know about during your chat session.


#### How LocalDocs Works
LocalDocs works by maintaining an index of all data in the directory your collection is linked to. This index
consists of small chunks of each document that the LLM can receive as additional input when you ask it a question.
The general technique this plugin uses is called [Retrieval Augmented Generation](https://arxiv.org/abs/2005.11401).

These document chunks help your LLM respond to queries with knowledge about the contents of your data.
The number of chunks and the size of each chunk can be configured in the LocalDocs plugin settings tab.
For indexing speed purposes, LocalDocs uses pre-deep-learning n-gram and TF-IDF based retrieval when deciding
what document chunks your LLM should use as context. You'll find its of comparable quality
with embedding based retrieval approaches but magnitudes faster to ingest data.

LocalDocs supports the following file types:
```json
["txt", "doc", "docx", "pdf", "rtf", "odt", "html", "htm",
    "xls", "xlsx", "csv", "ods", "ppt", "pptx", "odp", "xml", "json", "log", "md", "tex", "asc", "wks",
    "wpd", "wps", "wri", "xhtml", "xht", "xslt", "yaml", "yml", "dtd", "sgml", "tsv", "strings", "resx",
    "plist", "properties", "ini", "config", "bat", "sh", "ps1", "cmd", "awk", "sed", "vbs", "ics", "mht",
    "mhtml", "epub", "djvu", "azw", "azw3", "mobi", "fb2", "prc", "lit", "lrf", "tcr", "pdb", "oxps",
    "xps", "pages", "numbers", "key", "keynote", "abw", "zabw", "123", "wk1", "wk3", "wk4", "wk5", "wq1",
    "wq2", "xlw", "xlr", "dif", "slk", "sylk", "wb1", "wb2", "wb3", "qpw", "wdb", "wks", "wku", "wr1",
    "wrk", "xlk", "xlt", "xltm", "xltx", "xlsm", "xla", "xlam", "xll", "xld", "xlv", "xlw", "xlc", "xlm",
    "xlt", "xln"]
```

#### LocalDocs Limitations
LocalDocs allows your LLM to have context about the contents of your documentation collection.

LocalDocs currently cannot:

- Answer general metadata queries (e.g. `What documents do you know about?`, `Tell me about my documents`)
- Summarize *all* of your documents. It can however write a summary informed by the contents of your documents.

#### LocalDocs Roadmap
- Embedding based semantic search for retrieval. 
- Customize model fine-tuned with retrieval in the loop.

## Server Mode

GPT4All Chat comes with a built-in server mode allowing you to programmatically interact
with any supported local LLM through a *very familiar* HTTP API. You can find the API documentation [here](https://platform.openai.com/docs/api-reference/completions).

Enabling server mode in the chat client will spin-up on an HTTP server running on `localhost` port
`4891` (the reverse of 1984). You can enable the webserver via `GPT4All Chat > Settings > Enable web server`.

Begin using local LLMs in your AI powered apps by changing a single line of code: the base path for requests.

```python
import openai

openai.api_base = "http://localhost:4891/v1"
#openai.api_base = "https://api.openai.com/v1"

openai.api_key = "not needed for a local LLM"

# Set up the prompt and other parameters for the API request
prompt = "Who is Michael Jordan?"

# model = "gpt-3.5-turbo"
#model = "mpt-7b-chat"
model = "gpt4all-j-v1.3-groovy"

# Make the API request
response = openai.Completion.create(
    model=model,
    prompt=prompt,
    max_tokens=50,
    temperature=0.28,
    top_p=0.95,
    n=1,
    echo=True,
    stream=False
)

# Print the generated completion
print(response)
```

which gives the following response

```json
{
  "choices": [
    {
      "finish_reason": "stop",
      "index": 0,
      "logprobs": null,
      "text": "Who is Michael Jordan?\nMichael Jordan is a former professional basketball player who played for the Chicago Bulls in the NBA. He was born on December 30, 1963, and retired from playing basketball in 1998."
    }
  ],
  "created": 1684260896,
  "id": "foobarbaz",
  "model": "gpt4all-j-v1.3-groovy",
  "object": "text_completion",
  "usage": {
    "completion_tokens": 35,
    "prompt_tokens": 39,
    "total_tokens": 74
  }
}
```
LocalDocs documentation initial (#761) * LocalDocs documentation initial 1 year ago			`# GPT4All Chat UI`
Chat Client Documentation (#596) * GPT4All Chat Client Documentation * Updated documentation wording 1 year ago
			`The [GPT4All Chat Client](https://gpt4all.io) lets you easily interact with any local large language model.`

Chat doc typo (#605) * Added modal labs example to documentation * Added gpt4all chat * Typo * Andriy can't spell 1 year ago			`It is optimized to run 7-13B parameter LLMs on the CPU's of any computer running OSX/Windows/Linux.`
Chat Client Documentation (#596) * GPT4All Chat Client Documentation * Updated documentation wording 1 year ago

LocalDocs documentation initial (#761) * LocalDocs documentation initial 1 year ago			`## Plugins`
			`GPT4All Chat Plugins allow you to expand the capabilities of Local LLMs. All plugins are compatible with the`
			`chat clients server mode.`

Improved localdocs documentation (#762) * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation 1 year ago			`### LocalDocs Plugin (Chat With Your Data, PrivateGPT)`
LocalDocs documentation initial (#761) * LocalDocs documentation initial 1 year ago			`LocalDocs is a GPT4All plugin that allows you to chat with your local files and data.`
			`It allows you to utilize powerful local LLMs to chat with private data without any data leaving your computer or server.`
			`When using LocalDocs, your LLM will cite the sources that most likely contributed to a given output. Note, even an LLM equipped with LocalDocs can hallucinate.`

			`#### Enabling LocalDocs`
Improved localdocs documentation (#762) * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation 1 year ago			`1. Install the latest version of GPT4All Chat from [GPT4All Website](https://gpt4all.io).`
			2. Go to `Settings > LocalDocs tab`.
LocalDocs documentation initial (#761) * LocalDocs documentation initial 1 year ago			`3. Configure a collection (folder) on your computer that contains the files your LLM should have access to. You can alter the contents of the folder/directory at anytime. As you`
			`add more files to your collection, your LLM will dynamically be able to access them.`
			`4. Spin up a chat session with any LLM (including external ones like ChatGPT but warning data will leave your machine!)`
Improved localdocs documentation (#762) * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation 1 year ago			`5. At the top right, click the database icon and select which collection you want your LLM to know about during your chat session.`
LocalDocs documentation initial (#761) * LocalDocs documentation initial 1 year ago
Improved localdocs documentation (#762) * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation 1 year ago

			`#### How LocalDocs Works`
LocalDocs documentation initial (#761) * LocalDocs documentation initial 1 year ago			`LocalDocs works by maintaining an index of all data in the directory your collection is linked to. This index`
			`consists of small chunks of each document that the LLM can receive as additional input when you ask it a question.`
Improved localdocs documentation (#762) * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation 1 year ago			`The general technique this plugin uses is called [Retrieval Augmented Generation](https://arxiv.org/abs/2005.11401).`

			`These document chunks help your LLM respond to queries with knowledge about the contents of your data.`
LocalDocs documentation initial (#761) * LocalDocs documentation initial 1 year ago			`The number of chunks and the size of each chunk can be configured in the LocalDocs plugin settings tab.`
Improved localdocs documentation (#762) * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation 1 year ago			`For indexing speed purposes, LocalDocs uses pre-deep-learning n-gram and TF-IDF based retrieval when deciding`
			`what document chunks your LLM should use as context. You'll find its of comparable quality`
			`with embedding based retrieval approaches but magnitudes faster to ingest data.`

			`LocalDocs supports the following file types:`
			```json
			`["txt", "doc", "docx", "pdf", "rtf", "odt", "html", "htm",`
			`"xls", "xlsx", "csv", "ods", "ppt", "pptx", "odp", "xml", "json", "log", "md", "tex", "asc", "wks",`
			`"wpd", "wps", "wri", "xhtml", "xht", "xslt", "yaml", "yml", "dtd", "sgml", "tsv", "strings", "resx",`
			`"plist", "properties", "ini", "config", "bat", "sh", "ps1", "cmd", "awk", "sed", "vbs", "ics", "mht",`
			`"mhtml", "epub", "djvu", "azw", "azw3", "mobi", "fb2", "prc", "lit", "lrf", "tcr", "pdb", "oxps",`
			`"xps", "pages", "numbers", "key", "keynote", "abw", "zabw", "123", "wk1", "wk3", "wk4", "wk5", "wq1",`
			`"wq2", "xlw", "xlr", "dif", "slk", "sylk", "wb1", "wb2", "wb3", "qpw", "wdb", "wks", "wku", "wr1",`
			`"wrk", "xlk", "xlt", "xltm", "xltx", "xlsm", "xla", "xlam", "xll", "xld", "xlv", "xlw", "xlc", "xlm",`
			`"xlt", "xln"]`
			```

			`#### LocalDocs Limitations`
Fixed formatting of localdocs docs (#770) 1 year ago			`LocalDocs allows your LLM to have context about the contents of your documentation collection.`

			`LocalDocs currently cannot:`

			- Answer general metadata queries (e.g. `What documents do you know about?`, `Tell me about my documents`)
Improved localdocs documentation (#762) * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation 1 year ago			`- Summarize all of your documents. It can however write a summary informed by the contents of your documents.`
LocalDocs documentation initial (#761) * LocalDocs documentation initial 1 year ago
Improved localdocs documentation (#762) * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation * Improved localdocs documentation 1 year ago			`#### LocalDocs Roadmap`
			`- Embedding based semantic search for retrieval.`
			`- Customize model fine-tuned with retrieval in the loop.`
LocalDocs documentation initial (#761) * LocalDocs documentation initial 1 year ago
			`## Server Mode`
Chat Client Documentation (#596) * GPT4All Chat Client Documentation * Updated documentation wording 1 year ago
			`GPT4All Chat comes with a built-in server mode allowing you to programmatically interact`
			`with any supported local LLM through a very familiar HTTP API. You can find the API documentation [here](https://platform.openai.com/docs/api-reference/completions).`

			Enabling server mode in the chat client will spin-up on an HTTP server running on `localhost` port
Added better documentation to web server example in docs (#603) * Added modal labs example to documentation * Added gpt4all chat 1 year ago			`4891` (the reverse of 1984). You can enable the webserver via `GPT4All Chat > Settings > Enable web server`.
Chat Client Documentation (#596) * GPT4All Chat Client Documentation * Updated documentation wording 1 year ago
Chat doc fixes (#604) * Added modal labs example to documentation * Added gpt4all chat * Typo --------- Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com> 1 year ago			`Begin using local LLMs in your AI powered apps by changing a single line of code: the base path for requests.`
Chat Client Documentation (#596) * GPT4All Chat Client Documentation * Updated documentation wording 1 year ago
			```python
			`import openai`

Added better documentation to web server example in docs (#603) * Added modal labs example to documentation * Added gpt4all chat 1 year ago			`openai.api_base = "http://localhost:4891/v1"`
Chat Client Documentation (#596) * GPT4All Chat Client Documentation * Updated documentation wording 1 year ago			`#openai.api_base = "https://api.openai.com/v1"`

Update gpt4all_chat.md (#601) Signed-off-by: Andriy Mulyar <andriy.mulyar@gmail.com> 1 year ago			`openai.api_key = "not needed for a local LLM"`
Chat Client Documentation (#596) * GPT4All Chat Client Documentation * Updated documentation wording 1 year ago
			`# Set up the prompt and other parameters for the API request`
			`prompt = "Who is Michael Jordan?"`

			`# model = "gpt-3.5-turbo"`
			`#model = "mpt-7b-chat"`
			`model = "gpt4all-j-v1.3-groovy"`

			`# Make the API request`
			`response = openai.Completion.create(`
			`model=model,`
			`prompt=prompt,`
			`max_tokens=50,`
			`temperature=0.28,`
			`top_p=0.95,`
			`n=1,`
			`echo=True,`
			`stream=False`
			`)`

			`# Print the generated completion`
			`print(response)`
			```
Added better documentation to web server example in docs (#603) * Added modal labs example to documentation * Added gpt4all chat 1 year ago
			`which gives the following response`

			```json
			`{`
			`"choices": [`
			`{`
			`"finish_reason": "stop",`
			`"index": 0,`
			`"logprobs": null,`
			`"text": "Who is Michael Jordan?\nMichael Jordan is a former professional basketball player who played for the Chicago Bulls in the NBA. He was born on December 30, 1963, and retired from playing basketball in 1998."`
			`}`
			`],`
			`"created": 1684260896,`
			`"id": "foobarbaz",`
			`"model": "gpt4all-j-v1.3-groovy",`
			`"object": "text_completion",`
			`"usage": {`
			`"completion_tokens": 35,`
			`"prompt_tokens": 39,`
			`"total_tokens": 74`
			`}`
			`}`
			```