3a56a06c4f
Make some temporary modifications to the CrawledDocument model to support both a "big string" style headers field like in the old formats, and explicit fields as in the new formats. This is a bit awkward to deal with, but it's a necessity until we migrate off the old formats entirely. The commit also adds a few tests to this logic. |
||
---|---|---|
.. | ||
crawl-spec | ||
crawling-model | ||
processed-data | ||
work-log |