Sign in
Sign up
Explore
Enterprise
Education
Search
Help
Terms of use
About Us
Explore
Enterprise
Education
Gitee Premium
Gitee AI
AI teammates
Sign in
Sign up
Gitee 2025 OpenSource,We need your vote!
Categories
New Tech
Lowcode
科研论文
quantum
Web 3.0
Cloud Native
OpenHarmony
HarmonyOS Button
HarmonyOS EditText
HarmonyOS Layout
HarmonyOS Image
HarmonyOS Progress
HarmonyOS Menu
HarmonyOS Popup
HarmonyOS Selector
HarmonyOS TextView
HarmonyOS ListView
HarmonyOS Loading
HarmonyOS Notification
HarmonyOS View Transition
HarmonyOS Slider
HarmonyOS Chart
HarmonyOS Draw
HarmonyOS Counter
HarmonyOS Animate
HarmonyOS Captcha
HarmonyOS Multimedia
HarmonyOS Barcode
HarmonyOS Advanced
HarmonyOS Map
OpenHarmony Games
HarmonyOS Networking
HarmonyOS Communication
HarmonyOS Payment
HarmonyOS Database
HarmonyOS Drivers
OpenHarmony Guide
OpenHarmony DevTools
OpenHarmony App
HMS
HarmonyOS Permission
HarmonyOS Toolkit
OpenHarmony Components
Gesture
Development Lib
Chinese/English Segmenter
Payment Dev
Security Dev
Common Toolkit
Excel Toolkit
Barcode/QRCode
Template Engine
Desktop UI
Network Development Package
Audio Process
Network Tool
Network Service
Data Mining
Job/Task Scheduling
Programming Language/Scripting Language
Cache
Markdown Tools
Search Engine
Microservice
Workflow
Chart/Diagram Component
Authority Management
Reporting Tool
Code Generator
IoC/AOP Framework
Image Library
Rule Engine
JSON Toolkit
Log Toolkit
Spring Boot Extension
Verification Code
Algorithm/Mathematical Calculation
Node Extension
Process Engine
Animation Development
3G/4G/5G
AI/ML
Artificial Intelligence
VR/AR
Machine Learning/Deep Learning
Computer Vision/Face Recognition
Natural Language Processing
LLM
inference-lib
MCP
RAG
Industrial
Hardware
IoT/Edge Computing
车载应用
Car Application
Smart Home
Autopilot
Robots
5G
chips
Privacy Computing
Engineering
CAD
CAE
MBSE
CAM
EDA
Blockchain
bitcoin
NFT
Wechat Projects
Wechat Development Package
WeChat Applet/Game
WeChat Application
WeChat Game
Enterprise App
Task/Project Management
Enterprise Application System
Business Intelligence
Financial/Stock Securities
GIS/Map/Navigation/Positioning
Web System
Content Management System
New-Sale/E-Shop
BBS
Blog
Questionnaire
SNS
Teaching Managment
Album/Gallery/Picture
RSS/Atom Tool
Application
File Management System
Multimedia
Text Editor
Instant Messaging
Application Software
RPA-机器人过程自动化
Web Development
Web Framework
jQuery Plugin
UI Framework
JavaScript Toolkit
RESTful Projects
Backend Management
Website Theme
Vue.js Components
Web Sipder
OAuth/SSO
Angular Plugin
Bootstrap Plugin
React Compnent
RPC Development Framework
API Gateway
短网址
layui-components
DevOps/Network
Network Management Tool
System Monitor
DevOps
Mobile Dev
Android Component/ Project
iOS Component
Mobile App
Alipay Applet
Baidu Applet
PhoneGap/Cordova Plugin
Cross-platform Mobile Development
QuickApp
TV Devel
uniapp components
Development Tools
Version Management System
Dev/Debug
Wiki/Document
Compile/Build/Deploy
Maven Plugin
Gulp Extension
Testing Tool
Code Scan
Server Development
Distributed Service/Framework
Message Server/Message Queue
Docker
Container/Virtual Machine
Nginx Module
Big Data
Cloud Computing
One-click Installation Package
OpenResty Extension
系统性能优化
Serverless
storage
Database Related
DB Development Package
Database Service
Database Management/Monitor
Game/Recreation
Game
Game Development
3D Engine
Plugins/Extension
Chrome Extension
Wordpress Plugin
Eclipse Plugin
IDEA Plugin
Firefox Extension
Safari Extension
Visual Studio Code Plugin
Jenkins Plugins
Other
Simulation Project
Handbook/Manual/Tutorial
ACM/OJ Project
Operation System
Teaching Managment
Tutorial Code
RISC-V Development
Bio/Medical
2020公益黑客马拉松
新冠病毒相关开源
Trusted open-source mirrors
Safe Codebase Platform
Web Development
/
Web Sipder
Licenses
MulanPSL-2.0
0BSD
AFL-3.0
AGPL-3.0
Apache-2.0
Artistic-2.0
BSD-2-Clause
BSD-3-Clause
BSD-3-Clause-Clear
BSD-4-Clause
BSL-1.0
CC-BY-4.0
CC-BY-SA-4.0
CC0-1.0
CECILL-2.1
CERN-OHL-P-2.0
CERN-OHL-S-2.0
CERN-OHL-W-2.0
ECL-2.0
EPL-1.0
EPL-2.0
EUPL-1.1
EUPL-1.2
GFDL-1.3
GPL-2.0
GPL-3.0
ISC
LGPL-2.1
LGPL-3.0
LPPL-1.3c
MIT
MIT-0
MPL-2.0
MS-PL
MS-RL
MulanPSL-1.0
MulanPubL-1.0
MulanPubL-2.0
NCSA
ODbL-1.0
OFL-1.1
OSL-3.0
PostgreSQL
UPL-1.0
Unlicense
Vim
WTFPL
Zlib
Python
All Languages
Java
JavaScript
HTML
CSS
C
Shell
C++
TypeScript
PHP
C#
Go
Objective-C
Kotlin
Android
Ruby
Assembly
Swift
NodeJS
Dart
Lua
Perl
Rust
Matlab
其他
PowerShell
HTML/CSS
Scala
微信
Groovy
C/C++
Verilog
XSLT
R
QML
Pascal
Docker
CoffeeScript
FORTRAN
Erlang
Emacs Lisp
ActionScript
SQL
Smalltalk
M
VHDL
Delphi
TeX/LaTeX
ASP
Visual Basic
Common Lisp
Clojure
Scheme
Awk
LiveScript
Haskell
Julia
Elixir
易语言
Pawn
AutoHotkey
YAML
OCaml
Ada
D
Standard ML
Logos
Puppet
XML
Coq
Arduino
Prolog
VimL
汇编
Haxe
Vala
ColdFusion
Scilab
Crystal
Racket
Lisp
Slash
Eiffel
eC
DOT
Zephir
Nemerle
Stars
Stars
Recommend
Last updated
小铭同学/WorkAggregation
53
基于数据技术的互联网行业招聘信息聚合系统,拥有爬虫、分析、可视化、互动等功能
Python
Web Sipder
|
over 1 year ago
tansty/CSDN-spider
50
爬取csdn的文章并转换为md格式
Python
Web Sipder
|
5 years ago
梓泉没秃头/租房爬虫
48
用于租房的爬虫
Python
Web Sipder
|
over 5 years ago
好穷小子/影视资源库(站点+采集)
45
python语言,基于tornado框架,MySql数据库(peewee库操作mysql),自带网络爬虫程序
Python
Web Sipder
|
over 5 years ago
dwbmio/scrapy_proj
42
因为是oschina所有都是大中文了:)起因是看到一个网站很多kindle的资源丧心病狂想占为己有全部趴下来 自然使用了scrapyscrapy刚需安装beautifulsoup刚需安装mongodb随存储方式安装python渣要开始啦
Python
Web Sipder
|
over 6 years ago
kzeng/picpicker
41
根据图书ISBN抓图书封面图片的小程序
Python
Web Sipder
|
8 years ago
lidunwei/Medical-resources
39
收集及爬取开源的医学知识
Python
Bio/Medical
Web Sipder
|
5 years ago
JIANGWL/ZhihuAnalyse
36
知乎用户爬虫数据分析
Python
Web Sipder
|
almost 8 years ago
五十风/BeiJingSubwayFlows
33
北京地铁客流量统计(py爬虫+js统计图)
Python
Web Sipder
|
over 5 years ago
qchats/GetZPInfo
31
招聘信息抓取工具 GetZPInfo 这是一个爬虫软件,用来抓取某人力资源网站的招聘信息,并转发到本地串口连接的LED条型屏上显示。
Python
Web Sipder
|
over 12 years ago
king/news_spider
30
爬取各大新闻网站的数据,包括深圳新闻网、华尔街新闻等主流新闻平台数据,目前已靠此爬虫脚本采集了是几万条数据,爬虫自带增量更新功能,欢迎使用
Python
Web Sipder
|
over 6 years ago
liinux/WebScraping
25
《Web Scraping with Python》用python写网络爬虫一书的源代码。
Python
Web Sipder
|
almost 9 years ago
mrwang1992/doubangroupspider
20
一个scrapy爬虫项目,用来进行学习爬虫,提交到开源中国 是为了顺便学习git。
Python
Web Sipder
|
10 years ago
XksA-me/拉钩网数据爬虫
20
爬取拉勾网数据,并进行数据分析,可视化,分析你的专业最适合去那个城市,那个岗位最火热。
Python
Web Sipder
|
over 7 years ago
嗝嗝/FoolSpider
15
python 3.6 手写爬虫,傻瓜式爬虫,可自定义链接、代理、页码、数据库填充、代理IP,可抓取‘天猫’,‘京东’,‘花瓣’等优质网站,希望大家多多提交意见,完善
Python
Web Sipder
|
almost 8 years ago
1
2
3
4
5
Trending Projects
Today
Weekly
沈阳程序员/Scrapy-Python
130
Scrapy:网站爬虫框架库抓取
shengqiangzhang/examples-of-web-crawlers
589
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
AJay13/ECommerceCrawlers
5.2K
实战多种网站、电商数据爬虫。包含:淘宝商品、微信公众号、大众点评、招聘网站、闲鱼、阿里任务、scrapy博客园、微博、百度贴吧、豆瓣电影、包图网、全景网、豆瓣音乐、某省药监局、搜狐新闻、机器学习文本采集、fofa资产采集、汽车之家、国家统计局、百度关键词收录数、蜘蛛泛目录、今日头条、豆瓣影评️️️。微信爬虫展示项目:
AJay13/ECommerceCrawlers
5.2K
实战多种网站、电商数据爬虫。包含:淘宝商品、微信公众号、大众点评、招聘网站、闲鱼、阿里任务、scrapy博客园、微博、百度贴吧、豆瓣电影、包图网、全景网、豆瓣音乐、某省药监局、搜狐新闻、机器学习文本采集、fofa资产采集、汽车之家、国家统计局、百度关键词收录数、蜘蛛泛目录、今日头条、豆瓣影评️️️。微信爬虫展示项目:
shengqiangzhang/examples-of-web-crawlers
589
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
Wyatt/Bilibili-Crawler
1
一个基于Python的Bilibili视频下载工具,支持搜索视频、自动选择最佳画质和直接下载功能。
mktime/scrapy-douban-group
359
通过一个实际的项目,来学习如何使用scrapy爬取网络上的信息。这里以豆瓣小组为例,对组内的图片进行爬取,相关信息保存数据到MongoDB,图片下载到本地。
liweimin/爬虫代码片段 拼多多,团油,抖店
55
pinduoduo店铺订单采集 彩妆网商品采集 团油油站油价采集 douyin 抖音店铺数据采集 往约app数据采集 快手视频批量上传 抖音视频批量上传 异步采集写真/头像 登录农业银行 钉钉商机 西煤交易定时按键 设备协会人员资质 liweimin@taiyuan
沈阳程序员/Scrapy-Python
130
Scrapy:网站爬虫框架库抓取
惊鸿一回车/WeChat_Article
215
爬取微信公众号文章
isyuu/wxhub
345
微信公众号文章-无限制抓取
resolvewang/WeiboSpider
363
分布式微博爬虫。抓取内容包括微博用户资料、微博信息、评论信息和转发信息。目前专注于微博数据抓取本身,正在快速迭代。如果觉得有帮助,不妨到github上给我点个star,osc上可能不会再继续更新了
JIANGWL/ZhihuSpider
508
多线程知乎用户爬虫,基于python3
Going to Help Center
Search
Git 命令在线学习
如何在 Gitee 导入 GitHub 仓库
Git 仓库基础操作
企业版和社区版功能对比
SSH 公钥设置
如何处理代码冲突
仓库体积过大,如何减小?
如何找回被删除的仓库数据
Gitee 产品配额说明
GitHub仓库快速导入Gitee及同步更新
什么是 Release(发行版)
将 PHP 项目自动发布到 packagist.org
Back to the top