mirror of https://github.com/hwchase17/langchain
WhatsApp document loader - update regex (#2776)
I was testing out the WhatsApp Document loader, and noticed that sometimes the date is of the following format (notice the additional underscore): ``` 3/24/23, 1:54_PM - +91 99999 99999 joined using this group's invite link 3/24/23, 6:29_PM - +91 99999 99999: When are we starting then? ``` Wierdly, the underscore is visible in Vim, but not on editors like VSCode. I presume it is some unusual character/line terminator. Nevertheless, I think handling this edge case will make the document loader more robust.pull/2846/head
parent
2db9b7a45d
commit
7688bf9182
Loading…
Reference in New Issue