5329968155
This commit updates CrawlingThenConvertingIntegrationTest with additional tests for invalid, redirecting, and blocked domains. Improvements have also been made to filter out irrelevant entries in ParquetSerializableCrawlDataStream. |
||
---|---|---|
.. | ||
main/java | ||
test/java/nu/marginalia/crawling/parquet |