A scalable, mature and versatile web crawler based on Apache Storm
Stars
973
Forks
273
Watchers
973
Open Issues
26
Overall repository health assessment
No package.json found
This might not be a Node.js project
1.7k
commits
279
commits
143
commits
47
commits
33
commits
31
commits
17
commits
15
commits
13
commits
11
commits
Add cleanup() to URLFilter and fix timer/client leak in JSONURLFilterWrapper (#1867)
cdac7e4View on GitHubFix timer and client leak in OpenSearch JSONResourceWrapper (#1866)
29d3c26View on GitHubFix client leak on BulkProcessor or Sniffer construction failure (#1865)
5e5fc3bView on GitHubFix double-close race on static client in OpenSearch AbstractSpout (#1864)
45a9755View on GitHubFix thread-safety and minor issues in OkHttp protocol (#1862)
c523032View on GitHubFix bugs and resource leaks in Apache HttpClient protocol implementation (#1863)
d485783View on GitHubAdd lowercaseElementNames unit test and make method public (#1860)
df7dfe3View on GitHubAdd XPath JSoup filter tests for XSoup backward compatibility (#1857)
7f10122View on GitHubMinor: Regenerated License File for 25684d47033844cabc3dbeab9b56ef7af0b06b04 (#1858)
753d68dView on GitHubReplace XSoup with JSoup built-in XPath support (#1856)
25684d4View on GitHubFix race condition while having different proxies in different threads #1247 (#1855)
a08a456View on GitHubremove maximumsize for cache in abstractupdaterBolt (#1854)
55d744fView on GitHubMinor: Regenerated License File for 52bd4017863fd3f49f77511e0be5650983544d54 (#1853)
1d422b5View on GitHub