php 抓取网页源码( 2019年Python中一个第三方通过示例代码介绍(图))

优采云发布时间: 2022-03-11 15:06

　　php 抓取网页源码(

2019年Python中一个第三方通过示例代码介绍(图))

　　Python请求抓取一推文图代码示例

　　更新时间：2019-11-04 09:39:45 作者：常凡

　　本文章主要介绍python请求抓取一推文图代码的例子。示例代码在文章中有非常详细的介绍。对大家的学习或工作有一定的参考和学习价值。有需要的朋友可以参考以下

　　requests 是 Python 中的第三方库，基于 urllib，一个使用 Apache2 许可开源协议的 HTTP 库。它比 urllib 更方便，可以为我们省去很多工作，完全满足 HTTP 测试的要求。接下来，我们将记录requests的使用：

　　from bs4 import BeautifulSoup

from lxml import html

import xml

import requests

#下载图片函数

def download_img(url,name):

""""

下载指定url的图片

url：图片的url；

name:保存图片的名字

"""

try:

respone = requests.get(url)

f_img = respone.content

path = r'C:\Users\86131\Desktop\itchat\send_file\images\\%s.jpg'%(name)

with open(path, "wb")as f:

f.write(f_img)

except Exception as e:

print("---------地址出错------------")

url_list = []

f = requests.get("http://wufazhuce.com/")

# #打印网页内容

# print(f.content.decode())

soup = BeautifulSoup(f.content,"lxml")

try:

first_div = soup.find("div",attrs={'id':'main-container'}).find('div',attrs={'class':'carousel-inner'})

a_all = first_div.find_all('a')

for i in a_all:

url_list.append(i.attrs['href'])

except Exception as e:

print("---------出错------------")

#得到one的首页推荐页面

f_1 = requests.get(url_list[0])

#打印网页内容

# print(f_1.content.decode())

soup_1 = BeautifulSoup(f_1.content,"lxml")

try:

second_div = soup_1.find("div",attrs={'id':'main-container'}).find('div',attrs={'class':'one-cita-wrapper'})

third_div = soup_1.find("div",attrs={'id':'main-container'}).find('div',attrs={'class':'one-imagen'})

#获得时期值

now_month = second_div.find('p',attrs={'class':'may'}).text

now_one_day = second_div.find('p',attrs={'class':'dom'}).text

#获得图片的url

img_url = third_div.find('img').attrs['src']

#获得一段话并去除开头的空格

one_text = second_div.find("div",attrs={'class':'one-cita'}).text.strip()

#将获得日期拼接

now_day = now_one_day +' '+ now_month

#调用函数下载图片

download_img(img_url, now_day)

except Exception as e:

print("---------出错------------")

　　以上就是本文的全部内容，希望对大家的学习有所帮助，也希望大家多多支持Scripting Home。

0

2022-03-11

php 抓取网页源码

0 个评论

要回复文章请先登录或注册

AI时代内容工厂

php 抓取网页源码( 2019年Python中一个第三方通过示例代码介绍(图))

0 个评论

发起人

AI时代内容工厂

php 抓取网页源码( 2019年Python中一个第三方通过示例代码介绍(图))

0 个评论

发起人

相关问题