mirror of
https://github.com/openai/openai-cookbook
synced 2024-11-04 06:00:33 +00:00
Added a new notebook: "Parse PDF docs for RAG applications" (#1080)
Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: prestontuggle <97747561+prestontuggle@users.noreply.github.com> Co-authored-by: Shyamal H Anadkat <shyamal@openai.com> Co-authored-by: Simón Fishman <simonpfish@gmail.com> Co-authored-by: Ikko Eltociear Ashimine <eltociear@gmail.com> Co-authored-by: aalmaksour82 <49364099+aalmaksour82@users.noreply.github.com> Co-authored-by: colin-openai <119888926+colin-openai@users.noreply.github.com> Co-authored-by: Michael Wu <mwu1993@users.noreply.github.com> Co-authored-by: Logan Kilpatrick <logan@openai.com> Co-authored-by: Viet Hoang Tran Duong <36019296+viethoangtranduong@users.noreply.github.com> Co-authored-by: Christine Belzie <105683440+CBID2@users.noreply.github.com> Co-authored-by: Eliah Kagan <degeneracypressure@gmail.com> Co-authored-by: recordcrash <recordcrash@users.noreply.github.com> Co-authored-by: Stefano Lottini <hemidactylus@users.noreply.github.com> Co-authored-by: Safa Asgar <70315479+SaFaUU@users.noreply.github.com> Co-authored-by: Liam Thompson <32779855+leemthompo@users.noreply.github.com> Co-authored-by: Will DePue <will@depue.net> Co-authored-by: ys64 <815824+ys64@users.noreply.github.com> Co-authored-by: Shawn Yuxuan Tong <tongyuxuan361@gmail.com> Co-authored-by: Steven Pousty <steve.pousty@gmail.com> Co-authored-by: Puneet Dhiman <142409038+PuneetDhimanShorthillsAI@users.noreply.github.com> Co-authored-by: Krista Pratico <krpratic@microsoft.com> Co-authored-by: dongqqcom <32085836+dongqqcom@users.noreply.github.com> Co-authored-by: Alvaro Videla <videlalvaro@gmail.com> Co-authored-by: DevilsWorkShop <ashokmanghat@gmail.com> Co-authored-by: Ashok Manghat <amanghat@rmplc.net> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Matthew Jericho Go Sy <69558553+jerichosy@users.noreply.github.com> Co-authored-by: Farzad Sunavala <40604067+farzad528@users.noreply.github.com> Co-authored-by: Nirant <NirantK@users.noreply.github.com> Co-authored-by: Cathy Chen <cathykaichen@gmail.com> Co-authored-by: gusmally <hannahmbmoraes@gmail.com> Co-authored-by: Chuong Ho <31106432+chuongmep@users.noreply.github.com> Co-authored-by: ridrisa <138629783+ridrisa@users.noreply.github.com> Co-authored-by: Xin(Leo) Jing <jingxin@berkeley.edu> Co-authored-by: Per Harald Borgen <perhborgen@gmail.com> Co-authored-by: Hoang Viet Khoa <khoahv92@gmail.com> Co-authored-by: Stefano Fiorucci <44616784+anakin87@users.noreply.github.com> Co-authored-by: Albarqawi <barqawi.88@outlook.com> Co-authored-by: Saarika Bhasi <55930906+saarikabhasi@users.noreply.github.com> Co-authored-by: Daniel <10074684+danieltprice@users.noreply.github.com> Co-authored-by: Dhruv Anand <105786647+dhruv-anand-aintech@users.noreply.github.com> Co-authored-by: Jiří Hofman <jiri.hofman@gmail.com> Co-authored-by: Fayaz Rahman <fayazrahman4u@gmail.com> Co-authored-by: Anish Shah <93145909+ash0ts@users.noreply.github.com> Co-authored-by: Krish Dholakia <krrishdholakia@gmail.com> Co-authored-by: Emil Sedgh <emilsedgh@kde.org> Co-authored-by: Megan O'Keefe <3137106+askmeegs@users.noreply.github.com> Co-authored-by: Joschka Braun <47435119+joschkabraun@users.noreply.github.com> Co-authored-by: Roger Zurawicki <zurawiki@users.noreply.github.com> Co-authored-by: pavlovp <pavel.pavlov1990@gmail.com> Co-authored-by: Surav Shrestha <98219089+suravshresth@users.noreply.github.com> Co-authored-by: vrushankportkey <134934501+vrushankportkey@users.noreply.github.com> Co-authored-by: Soonoh <chk0ndanger@gmail.com> Co-authored-by: Mayuresh Dharwadkar <98738585+Mayureshd-18@users.noreply.github.com> Co-authored-by: Yashwant Jodha <76436993+yashwantjodha@users.noreply.github.com> Co-authored-by: Guspan Tanadi <36249910+guspan-tanadi@users.noreply.github.com> Co-authored-by: Ana Martins <60753223+OutSystemsAMM@users.noreply.github.com> Co-authored-by: Greg Richardson <greg.nmr@gmail.com> Co-authored-by: john <johnoctubre7@gmail.com> Co-authored-by: John Octubre <johnoctubre@Johns-MacBook-Pro.local> Co-authored-by: jhills20 <70035505+jhills20@users.noreply.github.com> Co-authored-by: Tad <wptady@gmail.com> Co-authored-by: Ilan Bigio <ilanbigio@gmail.com> Co-authored-by: Ilan Bigio <ilan@openai.com> Co-authored-by: royziv11 <103690170+royziv11@users.noreply.github.com> Co-authored-by: Gabor Cselle <gaborcselle@users.noreply.github.com> Co-authored-by: D. Carpintero <6709785+dcarpintero@users.noreply.github.com> Co-authored-by: Ed Spencer <ed@edspencer.net> Co-authored-by: Ravi Theja <ravi03071991@gmail.com> Co-authored-by: dylanra-openai <149511600+dylanra-openai@users.noreply.github.com> Co-authored-by: Taranjeet Singh <reachtotj@gmail.com> Co-authored-by: Frode Jensen <jensen.frode@gmail.com> Co-authored-by: Lionel Cheng <60159831+lionelchg@users.noreply.github.com> Co-authored-by: lionelchg <Cheng.Lionel@bcg.com> Co-authored-by: Jing Ai <42414856+jingairpi@users.noreply.github.com> Co-authored-by: Jing Ai <jingai@jings-air-2020.lan> Co-authored-by: Spring_MT <today.is.sky.blue.sky@gmail.com> Co-authored-by: kevleininger <kevleininger@gmail.com> Co-authored-by: Prakul <discover.prakul@gmail.com> Co-authored-by: Logan Kilpatrick <23kilpatrick23@gmail.com> Co-authored-by: Jiang Yucheng <fatjyc@gmail.com> Co-authored-by: Haomin Liu <644074553@qq.com> Co-authored-by: Xavier Amatriain <xavier.amatriain@gmail.com> Co-authored-by: Caio Curitiba Marcellos <caiocuritiba@gmail.com> Co-authored-by: Kesku <62210496+kesku@users.noreply.github.com> Co-authored-by: markbigears <86395716+markbigears@users.noreply.github.com> Co-authored-by: bigears <mark.forsyth@yourbigears.com> Co-authored-by: Nghiauet <63385521+Nghiauet@users.noreply.github.com> Co-authored-by: Vince Fulco--Bighire.tools <vince@bighire.io> Co-authored-by: Wang22004K <152562528+Wang22004K@users.noreply.github.com> Co-authored-by: Shaurya Rohatgi <shauryr@gmail.com> Co-authored-by: Dhruv Singh <ds3638@columbia.edu> Co-authored-by: Adam Hendel <ChuckHend@users.noreply.github.com> Co-authored-by: Enoch Cheung <enoch@enochc.com> Co-authored-by: Zanie Blue <contact@zanie.dev> Co-authored-by: rissois <44072214+rissois@users.noreply.github.com> Co-authored-by: ayush rajgor <ayushrajgorar@gmail.com> Co-authored-by: teomusatoiu <156829031+teomusatoiu@users.noreply.github.com> Co-authored-by: James Briggs <35938317+jamescalam@users.noreply.github.com> Co-authored-by: Shivam Rastogi <shivamsupr@gmail.com> Co-authored-by: Alex Yang <himself65@outlook.com> Co-authored-by: Elmira Ghorbani <elmira.ghorbani96@gmail.com> Co-authored-by: gloryjain <glory@openai.com> Co-authored-by: Andrew Peng <apeng@berkeley.edu>
This commit is contained in:
parent
88051d4185
commit
e92df85ad4
@ -72,3 +72,8 @@ teomusatoiu:
|
||||
name: "Teodora Musatoiu"
|
||||
website: "https://www.linkedin.com/in/teodora-musatoiu/"
|
||||
avatar: "https://avatars.githubusercontent.com/u/156829031?s=400&u=af40fe04d9255139eb3bbf8dc83422cc694e862b&v=4"
|
||||
|
||||
katiagg:
|
||||
name: "Katia Gil Guzman"
|
||||
website: "https://katia.gg"
|
||||
avatar: "https://avatars.githubusercontent.com/u/16519462?v=4"
|
2226
examples/Parse_PDF_docs_for_RAG.ipynb
Normal file
2226
examples/Parse_PDF_docs_for_RAG.ipynb
Normal file
File diff suppressed because one or more lines are too long
BIN
examples/data/example_pdfs/evals-decks.pdf
Normal file
BIN
examples/data/example_pdfs/evals-decks.pdf
Normal file
Binary file not shown.
BIN
examples/data/example_pdfs/fine-tuning-deck.pdf
Normal file
BIN
examples/data/example_pdfs/fine-tuning-deck.pdf
Normal file
Binary file not shown.
BIN
examples/data/example_pdfs/models-page.pdf
Normal file
BIN
examples/data/example_pdfs/models-page.pdf
Normal file
Binary file not shown.
BIN
examples/data/example_pdfs/rag-deck.pdf
Normal file
BIN
examples/data/example_pdfs/rag-deck.pdf
Normal file
Binary file not shown.
1
examples/data/parsed_pdf_docs.json
Normal file
1
examples/data/parsed_pdf_docs.json
Normal file
File diff suppressed because one or more lines are too long
@ -1166,7 +1166,7 @@
|
||||
path: examples/RAG_with_graph_db.ipynb
|
||||
date: 2023-12-08
|
||||
authors:
|
||||
- katia-openai
|
||||
- katiagg
|
||||
tags:
|
||||
- embeddings
|
||||
- completions
|
||||
@ -1203,3 +1203,12 @@
|
||||
tags:
|
||||
- vision
|
||||
- embeddings
|
||||
|
||||
- title: How to parse PDF docs for RAG
|
||||
path: examples/Parse_PDF_docs_for_RAG.ipynb
|
||||
date: 2024-02-28
|
||||
authors:
|
||||
- katiagg
|
||||
tags:
|
||||
- vision
|
||||
- embeddings
|
Loading…
Reference in New Issue
Block a user