GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

Copyright (c) 2026 Alexey Ratnikov

apify/crawlee-python - GitHub Explorer | GitHub Explorer | Trending | Compare

crawlee-python

apify•PUBLIC

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

apifyautomationbeautifulsoupcrawlercrawlinghacktoberfest

Apache License 2.0

Created on Jan 10, 2024

Updated on Apr 7, 2026

Stars

8.7k

Forks

703

Watchers

8.7k

Open Issues

82

Repository Health Score

💛

81/100

Good

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

headless

headless-chrome

parsel

pip

playwright

python

scraper

scraping

web-crawler

web-crawling

web-scraping

Community

8,733 stars, 703 forks

16/30

53%

Documentation

Has description, license

15/20

75%

Maintenance

0.9% issue ratio

20/20

100%

Health score is calculated based on activity, community engagement, documentation quality, and maintenance practices

Languages

Python

75.9%

MDX

17.7%

JavaScript

4.0%

CSS

2.0%

Lua

0.2%

Dockerfile

0.2%

Shell

0.0%

Issues Analytics

23

Total Issues

All time

11

Open

48% of total

12

Closed

52% of total

7d

Avg Close Time

Fast response ✅

Issues Activity: Last 6 months

Top Labels

Hottest Issues

1

#1802 `RequestList` with persistence doesn't handle malformed URLs correctly

t-tooling

7

closed

2

#1744 Redesign browser pool and the whole browsers subpackage

enhancementt-tooling

3

3

#1831 SqlStorageClient silently swallows write errors, causing data loss

bugt-tooling

3

open

4

#1784 PlaywrightCrawler __init__ method browser_new_context_options argument does not function

bugt-tooling

3

closed

5

#1756 Use new docker images in crawlee cli templates

t-tooling

1

2

closed

1

open

Dependencies

No package.json found

This might not be a Node.js project

Top Contributors

1

vdusek

User

305

commits

2

renovate[bot]

Bot

282

commits

3

Mantisus

User

174

commits

4

janbuchar

User

120

commits

5

Pijukatel

User

105

commits

6

B4nan

User

49

commits

7

github-actions[bot]

github-actions[bot]

Bot

39

commits

8

barjin

User

24

commits

9

souravjain540

User

7

commits

10

webrdaniel

User

3

commits

Recent Commits

chore(release): Update changelog and package version [skip ci]

github-actions[bot]•1 hour ago

cf6737cView on GitHub

fix: Apply SQLite optimizations to the custom `connection_string` in `SqlStorageClient` (#1837)

Max Bohomolov•1 hour ago

8b53e27View on GitHub

chore: Use only packages older than 24 hours (#1822)

Josef Procházka•5 days ago

31509e0View on GitHub

test: Fix flaky event manager tests by replacing sleep with wait (#1830)

Vlada Dusek•5 days ago

2c691d0View on GitHub

chore(release): Update changelog and package version [skip ci]

github-actions[bot]•5 days ago

d5715f3View on GitHub

fix: Prevent premature `EventManager` shutdown when multiple crawlers share it (#1810)

Max Bohomolov•5 days ago

2efb668View on GitHub

docs: Fix broken versioning of changelog (#1829)

Vlada Dusek•6 days ago

cb483b2View on GitHub

docs: port docs fixes from apify-docs (#1828)

Martin Adámek•1 week ago

d220bcfView on GitHub

docs: Fix version switching for API reference pages (#1823)

Vlada Dusek•1 week ago

f6be00bView on GitHub

test: Increase sleep tolerance in request_max_duration test for Windows CI (#1827)

Vlada Dusek•1 week ago

e2d7069View on GitHub

chore(release): Update changelog and package version [skip ci]

github-actions[bot]•1 week ago

41c21b7View on GitHub

fix(file-system): Reclaim orphaned in-progress requests on RQ recovery (#1825)

Vlada Dusek•1 week ago

e86794aView on GitHub

docs: add CloakBrowser stealth browser example (#1794)

Cloak-HQ•1 week ago

e1b4346View on GitHub

chore(deps): update rhysd/actionlint action to v1.7.12 (#1826)

renovate[bot]•1 week ago

a3b635eView on GitHub

chore(release): Update changelog and package version [skip ci]

github-actions[bot]•1 week ago

286fa3aView on GitHub

View all commits