Taobao_Python_Project

In the PyCharm, I utilized Scrapy framework to create a distributed python crawler. Both proxy IP pool and server camouflage technology are used in crawler design. Redis technology is also taken into consideration to improve the efficiency of data acquisition. I tested the crawler by crawling commodity information in Taobao. Then I wrote the crawled data asynchronously into MySQL database by Twisted framework. In the IDLE, pandas module was used to read the stored data in the database, and python modules like numpy and matplotlib are also used to realize data analysis and visualization. Finally, machine learning modules like sklearn, Gensim and keras are used to realize data classification, data clustering and correlation analysis.

Created on Nov 30, 2018

Updated on Aug 26, 2025

Stars

5

Forks

1

Watchers

5

Open Issues

2

Repository Health Score

❤️

20/100

Poor

Overall repository health assessment

Score Breakdown

Activity

Inactive - no updates in 3+ months

0/30

0%

Issues Analytics

2

Total Issues

All time

2

Open

100% of total

0

Closed

Recent Commits

Update description.txt

YuqingDuan•7 years ago

bbea206View on GitHub

Python Project

YuqingDuan•7 years ago

fd01aa4View on GitHub

Initial commit

YuqingDuan•7 years ago

7ad2a8cView on GitHub

View all commits

Community

5 stars, 1 forks

0/30

0%

Documentation

Has description, wiki

15/20

75%

Maintenance

40.0% issue ratio

5/20

25%

Health score is calculated based on activity, community engagement, documentation quality, and maintenance practices

Languages

Python

100.0%

Dependencies

No package.json found

This might not be a Node.js project

Top Contributors

1

YuqingDuan

User

3

commits

Languages

Python

100.0%

Issues Analytics

2

Total Issues

All time

2

Open

100% of total

0

Closed

0% of total

0h

Avg Close Time

Fast response ✅

Issues Activity: Last 6 months

Hottest Issues

1

#2 Please help to set taobao scrapy . I give you money

0

open

2

#1 Could you help me to do scrapy taobao product info? I give you money

0

open

Dependencies

No package.json found

This might not be a Node.js project

Top Contributors

1

YuqingDuan

User

3

commits

Recent Commits

Update description.txt

YuqingDuan•7 years ago

bbea206View on GitHub

Python Project

YuqingDuan•7 years ago

fd01aa4View on GitHub

Initial commit

YuqingDuan•7 years ago

7ad2a8cView on GitHub

View all commits

GitHub Explorer

Taobao_Python_Project

Score Breakdown

Issues Activity: Last 6 months

Hottest Issues