mirror of
https://github.com/hwchase17/langchain
synced 2024-10-29 17:07:25 +00:00
7688bf9182
I was testing out the WhatsApp Document loader, and noticed that sometimes the date is of the following format (notice the additional underscore): ``` 3/24/23, 1:54_PM - +91 99999 99999 joined using this group's invite link 3/24/23, 6:29_PM - +91 99999 99999: When are we starting then? ``` Wierdly, the underscore is visible in Vim, but not on editors like VSCode. I presume it is some unusual character/line terminator. Nevertheless, I think handling this edge case will make the document loader more robust. |
||
---|---|---|
.. | ||
conllu.conllu | ||
facebook_chat.json | ||
fake-content.html | ||
fake-email.eml | ||
fake-email.msg | ||
fake-power-point.pptx | ||
fake.docx | ||
layout-parser-paper.pdf | ||
mlb_teams_2012.csv | ||
notebook.ipynb | ||
telegram.json | ||
testing.enex | ||
whatsapp_chat.txt |