You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
langchain/docs
Martin Schade 0c7f1d8b21
Textract linearizer (#12446)
**Description:** Textract PDF Loader generating linearized output,
meaning it will replicate the structure of the source document as close
as possible based on the features passed into the call (e. g. LAYOUT,
FORMS, TABLES). With LAYOUT reading order for multi-column documents or
identification of lists and figures is supported and with TABLES it will
generate the table structure as well. FORMS will indicate "key: value"
with columms.
  - **Issue:** the issue fixes #12068 
- **Dependencies:** amazon-textract-textractor is added, which provides
the linearization
  - **Tag maintainer:** @3coins 

---------

Co-authored-by: Bagatur <baskaryan@gmail.com>
11 months ago
..
api_reference Merge pull request #12433 11 months ago
docs Textract linearizer (#12446) 11 months ago
docs_skeleton/docs/guides/langsmith mv old integration docs (#12217) 11 months ago
extras/guides/langsmith Bagatur/mv singlestore doc (#12053) 11 months ago
scripts notebook fmt (#12498) 11 months ago
src add cookbook table (#12043) 11 months ago
static Docs: QA Privacy Nit (#12025) 11 months ago
.local_build.sh langserve doc (#12357) 11 months ago
README.md Fix typos (#11663) 11 months ago
babel.config.js Restructure docs (#11620) 11 months ago
code-block-loader.js Restructure docs (#11620) 11 months ago
docusaurus.config.js Add dev guide to docs(#12291) 11 months ago
package-lock.json Bump @babel/traverse from 7.22.8 to 7.23.2 in /docs (#12453) 11 months ago
package.json Restructure docs (#11620) 11 months ago
settings.ini Restructure docs (#11620) 11 months ago
sidebars.js Docs: consolidate top nav (#12219) 11 months ago
vercel.json docs: Google Cloud Documentation Cleanup (#12224) 11 months ago
vercel_build.sh langserve doc (#12357) 11 months ago
vercel_requirements.txt Add api cross ref linking (#8275) 1 year ago

README.md

Website

This website is built using Docusaurus 2, a modern static website generator.

Installation

$ yarn

Local Development

$ yarn start

This command starts a local development server and opens up a browser window. Most changes are reflected live without having to restart the server.

Build

$ yarn build

This command generates static content into the build directory and can be served using any static contents hosting service.

Deployment

Using SSH:

$ USE_SSH=true yarn deploy

Not using SSH:

$ GIT_USER=<Your GitHub username> yarn deploy

If you are using GitHub pages for hosting, this command is a convenient way to build the website and push to the gh-pages branch.

Continuous Integration

Some common defaults for linting/formatting have been set for you. If you integrate your project with an open-source Continuous Integration system (e.g. Travis CI, CircleCI), you may check for issues using the following command.

$ yarn ci