mirror of https://github.com/hwchase17/langchain
adding language as parameter to NLTK text splitter (#10229)
- Description: Adding language as parameter to NLTK, by default it is only using English. This will help using NLTK splitter for other languages. Change is simple, via adding language as parameter to NLTKTextSplitter and then passing it to nltk "sent_tokenize". - Issue: N/A - Dependencies: N/A --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>pull/10291/head
parent
b3a8fc7cb1
commit
ddd07001f3
Loading…
Reference in New Issue