By default the following upload file formats are supported:
- Text
- Portable
- Microsoft Office
- Open Document
- Ebook
- JSON
- Comma-separated values (CSV(1))
- Custom
(1) - Notice the separator must be "," and not ";".
Take into account that simple files like .csv, .txt are expected to have utf-8 encoding.
Warning: All preprocessing is text related, images included in documents are not considered.
- .custom: use it to manually configure your desired chunks and metadata.
- .web: use it to Craw a web site.