Commit Graph

736 Commits

Author SHA1 Message Date
Simón Fishman
68772a70ae add descriptions to fine-tuning dataprep notebook (#673) 2023-09-02 08:26:55 -07:00
Simón Fishman
3119e772b9 clarify title (#672) 2023-09-01 10:14:11 -07:00
Will DePue
6fa65ef615 Update Customizing_embeddings.ipynb to be delete long cache output. (#667)
* patch

* remove print statement

---------

Co-authored-by: simonpfish <simonpfish@gmail.com>
2023-08-29 17:50:15 -07:00
Simón Fishman
d0c9afcac0 Revert "File name sanitization (#630)" (#668)
This reverts commit 169f5e02c8.
2023-08-29 17:45:47 -07:00
Stefano Lottini
f3990b8a8f Add Cassandra/Astra DB to the vector databases README (#665)
* Add Cassandra/Astra DB to the vector databases overall README

* Linking to a basic quickstart instead
2023-08-29 15:20:51 -07:00
Liam Thompson
cafe312611 Add elasticsearch examples to vector databases folder (#622)
* Add Elasticsearch to vector databases, add notebooks

* Update prompt

* Make intro verbiage more neutral

* Add semantic search notebook outputs

* Add RAG notebook output

* Update query

* Remove unreadable vector output
2023-08-29 10:54:08 -07:00
Safa Asgar
80a2307be0 File name sanitization (#630)
* File name sanitization

URL containing reserved characters blocks file name creation.

* Regular Expression fix for Sanitized URL

Co-authored-by: Simón Fishman <simonpfish@gmail.com>

---------

Co-authored-by: Simón Fishman <simonpfish@gmail.com>
2023-08-29 10:49:23 -07:00
Stefano Lottini
dc5bf03450 Add Astra DB (/Cassandra) to vector databases with example notebooks (#655)
* first commit for the Astra DB / Cassandra notebooks

* add json with quotes

* towards the final cassIO pilot notebook

* small changes to the copy

* cassIO pilot completed and its readme done

* fix silly markdown error

* fix silly markdown error 2

* astra vector link change and vector search picture improved

* added link to docs for connecting to Cassandra cluster

* CQL version of the flow

* revised readme

* final adjustments around metrics/distances

* links' final version assumes files in openai's main branch

* Add ref to QA+vector general guide; fix prompt; clarified conclusion paragraph; typos
2023-08-29 10:27:49 -07:00
Simón Fishman
88d47a1678 update semantic kernel link (#662)
the previous link pointed to a blog with multiple articles. this is a clearer link to point to
2023-08-28 18:32:29 -07:00
recordcrash
d9b4acd1b8 Fix UTF-8 encoding in Chat_finetuning_data_prep.ipynb (#648) 2023-08-28 18:12:30 -07:00
Eliah Kagan
1b3ef07d3e Add Tiktokenizer link in "How to count tokens" (#604)
This adds a link to Tiktokenizer webapp as another tool, in
addition to the OpenAI Tokenizer.
2023-08-28 10:28:19 -07:00
Christine Belzie
a4913d39dd [revise] small edits and fixed typos (#510)
* [revise] made small edits and fixed typos

* fix: revert back to original descriptions

* small language updates

---------

Co-authored-by: Simón Fishman <simonpfish@gmail.com>
2023-08-25 13:08:49 -07:00
Viet Hoang Tran Duong
7e40a56075 fix undefined variables in fine tune example (#660)
`create_user_message` and `test_df` are not defined.
2023-08-25 13:03:06 -07:00
Simón Fishman
c6c92acb2b update titles (#653) 2023-08-22 16:32:43 -07:00
Simón Fishman
bda60c1008 more fine-tuning improvements (#652)
* more fine-tuning improvements

* add links to other resources
2023-08-22 16:27:08 -07:00
Simón Fishman
3fac4b95d2 small improvements to the fine-tuning cookbook (#651) 2023-08-22 15:30:03 -07:00
Logan Kilpatrick
843ac20701 update readme to link fine-tuning guide (#650) 2023-08-22 14:09:03 -07:00
simonpfish
aa2b5ba709 update fine-tuning cookbook 2023-08-22 13:48:39 -07:00
Michael Wu
d639ac8f27 add ft data prep notebook (#647) 2023-08-22 12:24:42 -07:00
colin-openai
20d802b13b Pushing cookbook for fine-tuning via ChatCompletion (#646)
* Pushing cookbook for fine-tuning via ChatCompletion

* Add correct file

* Fixed bug with create_prompt function and refactored files
2023-08-22 12:24:22 -07:00
aalmaksour82
1100f84e0d Fixed JSON formatting bug in arxiv_functions (#562) 2023-08-18 09:26:04 +01:00
Ikko Eltociear Ashimine
0a857e9c5e Fix typo in redis-hybrid-query-examples.ipynb (#642)
bellow -> below
2023-08-17 03:21:48 -07:00
Shyamal H Anadkat
0d64d2e481 Update recently added (#641)
* Update recently added

* Update README.md

Co-authored-by: Simón Fishman <simonpfish@gmail.com>

---------

Co-authored-by: Simón Fishman <simonpfish@gmail.com>
2023-08-16 17:59:19 -07:00
Simón Fishman
7719400ef0 fix date typo (#640) 2023-08-16 17:38:48 -07:00
Shyamal H Anadkat
a0f7f529b9 Adds reference to SummEval (#639) 2023-08-16 14:20:22 -07:00
Simón Fishman
f11682c22b various minor improvements (#638) 2023-08-16 10:53:54 -07:00
Shyamal H Anadkat
b8d9b8c8a8 adds notebook on abstractive summarization eval (#637) 2023-08-16 10:25:42 -07:00
prestontuggle
dd1b8980e9 adds one function calling notebook (#629)
* adds one function calling notebook

* reordered imports and ran all cells

* fixed comma error
2023-08-11 20:34:29 -04:00
prestontuggle
0a9a070436 adds two Whisper guides (#628)
* adds two Whisper guides

* polishing

* fixes function argument misordering

---------

Co-authored-by: Ted Sanders <ted@openai.com>
2023-08-11 16:11:19 -04:00
Shantanu Nair
4619f366e8 Fix function description (#626) 2023-08-09 10:34:59 -07:00
Ted Sanders
8e2c7b44c4 Add prompttools to prompting libraries (#620) 2023-08-01 22:51:45 -07:00
Anton Troynikov
ab3e8524aa Update notebooks (#598) 2023-07-24 15:59:33 -07:00
richzw
39f1776c1a add tiktoken-go to How_to_count_tokens_with_tiktoken.ipynb (#605) 2023-07-24 15:54:52 -07:00
Krista Pratico
f7266e7e4f [azure] add functions notebook sample (#595)
* add azure functions notebook sample

* update api key to use env var + note use of env vars over config in code across azure samples
2023-07-21 16:38:49 -07:00
ancri
35b88b80ac consolidate Embedding.create calls into one (#543) 2023-07-20 20:20:04 -07:00
Tomas Dulka
8f24e11b2a replace eval with safer literal_eval (#561) 2023-07-17 16:40:54 -07:00
Sebastian Witalec
ac094dd124 fix markdown typo on a link (#591) 2023-07-17 15:53:44 -07:00
Alex Dhillon
2351ec31bf simplify pretty print (#575) 2023-07-16 14:22:46 -07:00
Yoav Farhi
9db10b5805 Update the actual number of results used in generating the final answer (#587)
The comment mentions 20, but it actually uses 5
2023-07-13 10:47:49 -07:00
Moiz Sajid
04a203089c Updated README.md in examples/vector_databases (#581)
Added the links to the missing vector databases
2023-07-12 12:28:45 -07:00
Ikko Eltociear Ashimine
2a1b6c30a8 Fix typo in QA_with_Langchain_AnalyticDB_and_OpenAI.ipynb (#582)
futher -> further
2023-07-12 12:28:14 -07:00
Jason M
aadfcd040d Update How_to_call_functions_with_chat_models.ipynb (#545)
Updates "min" parameter to "multiplier". Should run the same.
2023-07-11 17:38:33 -07:00
liuchengshan-lcs
a667fce73f Add getting started with PolarDB vector database and OpenAI example. (#489) 2023-07-11 17:13:26 -07:00
Douglas Blank
d6acc8894f Added notebook example for visualizing embeddings in Kangas (#469)
* Added notebook example for visualizing embeddings in Kangas

Plots UMAP projection space, one per row, in open source Kangas DataGrid.

For more information about Kangas, see: https://github.com/comet-ml/kangas

* Moved notebook to third_party_examples
2023-07-11 17:11:16 -07:00
Eli
015a401dcf Enhancements and Refactoring of Python Code Extraction Methods (#467)
* Refactor and enhance code extraction methods.

* Use f-strings to print filepaths, improving readability.
2023-07-11 17:08:37 -07:00
Ted Sanders
50ae26c0e4 fixes token counting in translate_latex_book.ipynb (#579)
* fixes token counting in translate_latex_book.ipynb

* adds back comment
2023-07-11 17:00:38 -07:00
Ted Sanders
86c04efb3b Update README.md with fixed vector DB link (#576) 2023-07-10 12:31:35 -07:00
sun zhun
71b33d8a71 Update How_to_handle_rate_limits.ipynb (#554)
Replace deprecated model "code-cushman-001" with "gpt-3.5-turbo".
2023-06-29 22:23:39 -07:00
Kacper Łukawski
645ec14ea2 Refactor Qdrant notebooks (#556)
* Upgrade Qdrant to 1.3.0

* Adapt the descriptions and run the missing cells
2023-06-29 07:47:18 -07:00
shayarnett
f45d5f38a2 Update broken link in README.md (#558)
Reranking search with cross-encoders link was incorrect.
2023-06-29 07:46:24 -07:00