langchain/tests/integration_tests/document_loaders
Jon Saginaw f8d69e4e52
Enhancement: Blockchain Document Loader with better Metadata support (#3710)
This PR includes some minor alignment updates, including:

- metadata object extended to support contractAddress, blockchainType,
and tokenId
- notebook doc better aligned to standard langchain format
- startToken changed from int to str to support multiple hex value types
on the Alchemy API

The updated metadata will look like the below. It's possible for a
single contractAddress to exist across multiple blockchains (e.g.
Ethereum, Polygon, etc.) so it's important to include the
blockchainType.

```
 metadata = {"source": self.contract_address, 
                      "blockchain": self.blockchainType,
                      "tokenId": tokenId}
```
2023-04-28 20:13:05 -07:00
..
__init__.py
test_arxiv.py Arxiv document loader (#3627) 2023-04-26 21:04:56 -07:00
test_bigquery.py
test_bilibili.py Added bilibili loader (#2673) (#2724) 2023-04-11 10:40:32 -07:00
test_blockchain.py Enhancement: Blockchain Document Loader with better Metadata support (#3710) 2023-04-28 20:13:05 -07:00
test_bshtml.py Add get_text_separator parameter to BSHTMLLoader (#3551) 2023-04-26 16:10:16 -07:00
test_confluence.py Several confluence loader improvements (#3300) 2023-04-23 15:06:10 -07:00
test_dataframe.py rm pandas dependency (#2102) 2023-03-28 08:38:19 -07:00
test_duckdb.py
test_email.py Harrison/msg files (#2375) 2023-04-04 06:48:34 -07:00
test_figma.py
test_gitbook.py Harrison/gitbook (#2044) 2023-03-28 15:28:33 -07:00
test_ifixit.py
test_pdf.py Add Mathpix pdf loader (#3727) 2023-04-28 20:11:22 -07:00
test_python.py Add PythonLoader which auto-detects encoding of Python files (#3311) 2023-04-21 10:47:57 -07:00
test_sitemap.py
test_slack.py Add Slack Directory Loader (#2841) 2023-04-13 21:31:59 -07:00
test_stripe.py Harrison/stripe (#3762) 2023-04-28 20:03:21 -07:00
test_url_playwright.py Harrison/playwright selector (#3185) 2023-04-19 16:54:15 -07:00
test_url.py add continue to fix 'continue_on_failure' parameter for URL doc loader (#2735) 2023-04-11 21:12:39 -07:00