侯体宗的博客
  • 首页
  • Hyperf版
  • beego仿版
  • 人生(杂谈)
  • 技术
  • 关于我
  • 更多分类
    • 文件下载
    • 文字修仙
    • 中国象棋ai
    • 群聊
    • 九宫格抽奖
    • 拼图
    • 消消乐
    • 相册

Python实时数据采集-新型冠状病毒

Python  /  管理员 发布于 7年前   215


Python实时数据采集-新型冠状病毒

源代码 来源:https://github.com/Programming-With-Love/2019-nCoV

疫情数据时间为:2020.2.1

项目相关截图:

全国数据展示

国内数据展示

国外数据展示

查看指定区域详细数据

源代码,注意安装所需模块(例如 pip install 模块名)

import requestsimport refrom bs4 import BeautifulSoupfrom time import sleepimport jsonfrom prettytable import ALLfrom prettytable import PrettyTablehubei = {}guangdong = {}zhejiang = {}beijing = {}shanghai = {}hunan = {}anhui = {}chongqing = {}sichuan = {}shandong = {}guangxi = {}fujian = {}jiangsu = {}henan = {}hainan = {}tianjin = {}jiangxi = {}shanxi1 = {} # 陕西guizhou = {}liaoning = {}xianggang = {}heilongjiang = {}aomen = {}xinjiang = {}gansu = {}yunnan = {}taiwan = {}shanxi2 = {} # 山西jilin = {}hebei = {}ningxia = {}neimenggu = {}qinghai = {} # nonexizang = {} # noneprovinces_idx = [hubei, guangdong, zhejiang, chongqing, hunan, anhui, beijing,     shanghai, henan, guangxi, shandong, jiangxi, jiangsu, sichuan,     liaoning, fujian, heilongjiang, hainan, tianjin, hebei, shanxi2,     yunnan, xianggang, shanxi1, guizhou, jilin, gansu, taiwan,     xinjiang, ningxia, aomen, neimenggu, qinghai, xizang]map = {    '湖北':0, '广东':1, '浙江':2, '北京':3, '上海':4, '湖南':5, '安徽':6, '重庆':7,    '四川':8, '山东':9, '广西':10, '福建':11, '江苏':12, '河南':13, '海南':14,    '天津':15, '江西':16, '陕西':17, '贵州':18, '辽宁':19, '香港':20, '黑龙江':21,    '澳门':22, '新疆':23, '甘肃':24, '云南':25, '台湾':26, '山西':27, '吉林':28,    '河北':29, '宁夏':30, '内蒙古':31, '青海':32, '西藏':33}def getTime(text):    TitleTime = str(text)    TitleTime = re.findall('<span>(.*?)</span>', TitleTime)    return TitleTime[0]def getAllCountry(text):    AllCountry = str(text)    AllCountry = AllCountry.replace("[<p class=\"confirmedNumber___3WrF5\"><span class=\"content___2hIPS\">", "")    AllCountry = AllCountry.replace("<span style=\"color: #4169e2\">", "")    AllCountry = re.sub("</span>", "", AllCountry)    AllCountry = AllCountry.replace("</p>]", "")        AllCountry = AllCountry.replace("<span style=\"color: rgb(65, 105, 226);\">", "")    AllCountry = re.sub("<span>", "", AllCountry)    AllCountry = re.sub("<p>", "", AllCountry)    AllCountry = re.sub("</p>", "", AllCountry)    return AllCountry def query(province):    table = PrettyTable(['地区', '确诊', '死亡', '治愈'])    for (k, v) in province.items():        name = k        table.add_row([name, v[0] if v[0] != 0 else '-', v[1] if v[1] != 0 else '-', v[2] if v[2] != 0 else '-'])    if len(province.keys()) != 0:        print(table)    else:        print("暂无")def getInfo(text):    text = str(text)    text = re.sub("<p class=\"descText___Ui3tV\">", "", text)    text = re.sub("</p>", "", text)    return textdef is_json(json_str):    try:        json.loads(json_str)    except ValueError:        return False    return Truedef ff(str, num):    return str[:num] + str[num+1:]        def main():    url = "https://3g.dxy.cn/newh5/view/pneumonia"    try:        headers = {}        headers['user-agent'] = 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/70.0.3538.77 Safari/537.36' #http头大小写不敏感        headers['accept'] = 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8'        headers['Connection'] = 'keep-alive'        headers['Upgrade-Insecure-Requests'] = '1'        r = requests.get(url, headers=headers)        r.raise_for_status()        r.encoding = r.apparent_encoding        soup = BeautifulSoup(r.text,'lxml')        table = PrettyTable(['地区', '确诊', '死亡', '治愈'])        table.hrules = ALL        #### 截至时间        # TitleTime = getTime(soup.select('.title___2d1_B'))    print()        # print("  ",TitleTime + "\n")        while True:r = requests.get("https://service-f9fjwngp-1252021671.bj.apigw.tencentcs.com/release/pneumonia")json_str = json.loads(r.text)if json_str['error'] == 0:    break        print("==================================全国数据==================================")        print()    print("     确诊 " + str(json_str['data']['statistics']['confirmedCount']) + " 例"+ "       " + "疑似 " + str(json_str['data']['statistics']['suspectedCount']) + " 例"+ "       " + "死亡" + str(json_str['data']['statistics']['deadCount']) + " 例"+ "       " + "治愈" + str(json_str['data']['statistics']['curedCount']) + " 例\n")        print("==================================相关情况==================================")        print()        print("传染源:" + json_str['data']['statistics']['infectSource'])        print("病毒:" + json_str['data']['statistics']['virus'])        print("传播途径:" + json_str['data']['statistics']['passWay'])        print(json_str['data']['statistics']['remark1'])        print(json_str['data']['statistics']['remark2'] + "\n")        print("==================================国内情况==================================")        print()    json_provinces = re.findall("{\"provinceName\":(.*?)]}", str(soup))        idx = 0        for province in json_provinces:if is_json(province):    passelse:    province = "{\"provinceName\":" + province + "]}"    province = json.loads(province)    province_name = province['provinceShortName'] if province['provinceShortName'] != 0 else '-'confirmed = province['confirmedCount'] if province['confirmedCount'] != 0 else '-'suspected = province['suspectedCount'] if province['suspectedCount'] != 0 else '-'cured = province['curedCount'] if province['curedCount'] != 0 else '-'dead = province['deadCount'] if province['deadCount'] != 0 else '-'table.add_row([province_name, confirmed, dead, cured])map[province_name] = idxidx = idx + 1for city in province['cities']:    provinces_idx[map[province_name]][city['cityName']] = [city['confirmedCount'], city['deadCount'], city['curedCount']]        print(table)print()        print("==================================国外情况==================================")        print()        json_provinces = str(re.findall("\"id\":949(.*?)]}", str(soup)))        json_provinces = json_provinces[:1] + "{\"id\":949" + json_provinces[2:]        json_provinces = json_provinces[:len(json_provinces) - 2] + json_provinces[len(json_provinces) - 1:]        provinces = json.loads(json_provinces)        table = PrettyTable(['地区', '确诊', '死亡', '治愈'])        for province in provinces:confirmed = province['confirmedCount'] if province['confirmedCount'] != 0 else '-'dead = province['deadCount'] if province['deadCount'] != 0 else '-'cured = province['curedCount'] if province['curedCount'] != 0 else '-'table.add_row([province['provinceName'], confirmed, dead, cured])    print(table)        print()    print("==================================最新消息==================================")        print()    idx = 0        for news in json_str['data']['timeline']:if idx == 5:    breakprint(news['pubDateStr'] + "  " + news['title'])idx = idx + 1    print()        key = input("请输入您想查询详细信息的省份,例如 湖北\n")        print()        if key in map.keys():query(provinces_idx[map[key]])        else:print("暂无相关信息")        print("\n欢迎提出各种意见")    except:        print("连接失败")if __name__ == '__main__':    main()    sleep(30)

最后,祝大家百毒不侵,中国加油!!一定能够度过难关!!

以上就是Python实时数据采集-新型冠状病毒的详细内容,更多请关注其它相关文章!


  • 上一条:
    python3.8.0安装教程
    下一条:
    python统计不同字符的个数
  • 昵称:

    邮箱:

    0条评论 (评论内容有缓存机制,请悉知!)
    最新最热
    • 分类目录
    • 人生(杂谈)
    • 技术
    • linux
    • Java
    • php
    • 框架(架构)
    • 前端
    • ThinkPHP
    • 数据库
    • 微信(小程序)
    • Laravel
    • Redis
    • Docker
    • Go
    • swoole
    • Windows
    • Python
    • 苹果(mac/ios)
    • 相关文章
    • 在python语言中Flask框架的学习及简单功能示例(0个评论)
    • 在Python语言中实现GUI全屏倒计时代码示例(0个评论)
    • Python + zipfile库实现zip文件解压自动化脚本示例(0个评论)
    • python爬虫BeautifulSoup快速抓取网站图片(1个评论)
    • vscode 配置 python3开发环境的方法(0个评论)
    • 近期文章
    • 在go中实现一个常用的先进先出的缓存淘汰算法示例代码(0个评论)
    • 在go+gin中使用"github.com/skip2/go-qrcode"实现url转二维码功能(0个评论)
    • 在go语言中使用api.geonames.org接口实现根据国际邮政编码获取地址信息功能(1个评论)
    • 在go语言中使用github.com/signintech/gopdf实现生成pdf分页文件功能(0个评论)
    • gmail发邮件报错:534 5.7.9 Application-specific password required...解决方案(0个评论)
    • 欧盟关于强迫劳动的规定的官方举报渠道及官方举报网站(0个评论)
    • 在go语言中使用github.com/signintech/gopdf实现生成pdf文件功能(0个评论)
    • Laravel从Accel获得5700万美元A轮融资(0个评论)
    • 在go + gin中gorm实现指定搜索/区间搜索分页列表功能接口实例(0个评论)
    • 在go语言中实现IP/CIDR的ip和netmask互转及IP段形式互转及ip是否存在IP/CIDR(0个评论)
    • 近期评论
    • 122 在

      学历:一种延缓就业设计,生活需求下的权衡之选中评论 工作几年后,报名考研了,到现在还没认真学习备考,迷茫中。作为一名北漂互联网打工人..
    • 123 在

      Clash for Windows作者删库跑路了,github已404中评论 按理说只要你在国内,所有的流量进出都在监控范围内,不管你怎么隐藏也没用,想搞你分..
    • 原梓番博客 在

      在Laravel框架中使用模型Model分表最简单的方法中评论 好久好久都没看友情链接申请了,今天刚看,已经添加。..
    • 博主 在

      佛跳墙vpn软件不会用?上不了网?佛跳墙vpn常见问题以及解决办法中评论 @1111老铁这个不行了,可以看看近期评论的其他文章..
    • 1111 在

      佛跳墙vpn软件不会用?上不了网?佛跳墙vpn常见问题以及解决办法中评论 网站不能打开,博主百忙中能否发个APP下载链接,佛跳墙或极光..
    • 2016-10
    • 2016-11
    • 2018-04
    • 2020-03
    • 2020-04
    • 2020-05
    • 2020-06
    • 2022-01
    • 2023-07
    • 2023-10
    Top

    Copyright·© 2019 侯体宗版权所有· 粤ICP备20027696号 PHP交流群

    侯体宗的博客