文章詳情頁

python 爬取免費簡歷模板網站的示例

瀏覽：2日期：2022-07-09 17:14:52

代碼

# 免費的簡歷模板進行爬取本地保存 # http://sc.chinaz.com/jianli/free.html# http://sc.chinaz.com/jianli/free_2.htmlimport requestsfrom lxml import etreeimport osdirName = ’./resumeLibs’if not os.path.exists(dirName): os.mkdir(dirName)headers = { ’User-Agent’:’Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/85.0.4183.83 Safari/537.36’}url = ’http://sc.chinaz.com/jianli/free_%d.html’for page in range(1,2): if page == 1: new_url = ’http://sc.chinaz.com/jianli/free.html’ else: new_url = format(url%page) page_text = requests.get(url=new_url,headers=headers).text tree = etree.HTML(page_text) a_list = tree.xpath(’//div[@id='container']/div/p/a’) for a in a_list: a_src = a.xpath(’./@href’)[0] a_title = a.xpath(’./text()’)[0] a_title = a_title.encode(’iso-8859-1’).decode(’utf-8’) # 爬取下載頁面 page_text = requests.get(url=a_src,headers=headers).text tree = etree.HTML(page_text) dl_src = tree.xpath(’//div[@id='down']/div[2]/ul/li[8]/a/@href’)[0]resume_data = requests.get(url=dl_src,headers=headers).content resume_name = a_title resume_path = dirName + ’/’ + resume_name + ’.rar’ with open(resume_path,’wb’) as fp: fp.write(resume_data) print(resume_name,’下載成功!’)

以上就是python 爬取免費簡歷模板網站的示例的詳細內容，更多關于python 爬取網站的資料請關注好吧啦網其它相關文章！

Python 編程

上一條：Python日志器使用方法及原理解析下一條：python如何提升爬蟲效率

相關文章：

1. Java8內存模型PermGen Metaspace實例解析2. Spring security 自定義過濾器實現Json參數傳遞并兼容表單參數(實例代碼)3. ASP.NET MVC使用正則表達式驗證手機號碼4. 一文搞懂 parseInt()函數異常行為5. python wsgiref源碼解析6. python學習之plot函數的使用教程7. python利用paramiko實現交換機巡檢的示例8. python中用Scrapy實現定時爬蟲的實例講解9. 聊聊python在linux下與windows下導入模塊的區別說明10. python 實現關聯規則算法Apriori的示例

排行榜

					
					Spring security 自定義過濾器實現Json參數傳遞并兼容表單參數(實例代碼)
Java8內存模型PermGen Metaspace實例解析
IDEA 去除 mybatis.xml 文件黃色警告的圖文教程
Flex挑戰Java和.NET Adobe能否再度崛起
python不到50行代碼完成了多張excel合并的實現示例
基于vue實現簡易打地鼠游戲
ASP.NET MVC使用正則表達式驗證手機號碼
python tkinter實現下載進度條及抖音視頻去水印原理
Python通過Pillow實現圖片對比
JavaScript 防篡改對象的用法示例
解決Django Haystack全文檢索為空的問題