langchain/tests/unit_tests
Sasmitha Manathunga 0c7a5cb206
Fix inconsistent behavior of CharacterTextSplitter when changing keep_separator (#7263)
- Description:
- When `keep_separator` is `True` the `_split_text_with_regex()` method
in `text_splitter` uses regex to split, but when `keep_separator` is
`False` it uses `str.split()`. This causes problems when the separator
is a special regex character like `.` or `*`. This PR fixes that by
using `re.split()` in both cases.
- Issue: #7262 
- Tag maintainer: @baskaryan
2023-07-06 09:30:03 -04:00
..
agents Add OpenAIMultiFunctionsAgent to import list in agents directory (#6824) 2023-06-29 18:34:26 -07:00
callbacks Add serialized object to retriever start callback (#7074) 2023-07-05 18:04:43 +01:00
chains Docs combine document chain (#6994) 2023-07-04 12:51:04 -06:00
chat_models Harrison/split schema dir (#7025) 2023-07-01 13:39:19 -04:00
client Add Input Mapper in run_on_dataset (#6894) 2023-06-29 16:53:49 -07:00
data
docstore Enable InMemoryDocstore to be constructed without providing a dict (#6976) 2023-07-05 16:56:31 -04:00
document_loaders Document loader for Cube Semantic Layer (#6882) 2023-07-05 15:18:12 -07:00
evaluation Simplify eval arg names (#6944) 2023-06-30 07:47:53 -07:00
examples Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
llms Harrison/split schema dir (#7025) 2023-07-01 13:39:19 -04:00
load Include placeholder value for all secrets, not just kwargs (#6421) 2023-06-19 15:41:45 +01:00
memory Harrison/split schema dir (#7025) 2023-07-01 13:39:19 -04:00
output_parsers Update String Evaluator (#6615) 2023-06-26 14:16:14 -07:00
prompts Jinja2 validation changed to issue warnings rather than issuing exceptions. (#7161) 2023-07-05 14:04:29 -04:00
retrievers Add serialized object to retriever start callback (#7074) 2023-07-05 18:04:43 +01:00
tools Fixed Office365 tool __init__.py files, tests, and get_tools() function (#7046) 2023-07-05 15:46:21 -04:00
utilities Fix graphql tool (#4984) 2023-05-19 15:27:50 -07:00
vectorstores Add maximal relevance search to SKLearnVectorStore (#5430) 2023-05-30 16:13:33 -07:00
__init__.py
conftest.py
test_bash.py Add Mastodon toots loader (#5036) 2023-05-22 16:43:07 -07:00
test_cache.py Harrison/split schema dir (#7025) 2023-07-01 13:39:19 -04:00
test_dependencies.py Docs combine document chain (#6994) 2023-07-04 12:51:04 -06:00
test_document_transformers.py
test_formatting.py
test_math_utils.py add get_top_k_cosine_similarity method to get max top k score and index (#5059) 2023-05-22 11:55:48 -07:00
test_pytest_config.py
test_python.py
test_schema.py Harrison/split schema dir (#7025) 2023-07-01 13:39:19 -04:00
test_sql_database_schema.py
test_sql_database.py Fix SQLAlchemy truncating text when it is too big (#5206) 2023-06-01 21:33:31 -04:00
test_text_splitter.py Fix inconsistent behavior of CharacterTextSplitter when changing keep_separator (#7263) 2023-07-06 09:30:03 -04:00