推荐学习书目
Learn Python the Hard Way
Python Sites
PyPI - Python Package Index
http://diveintopython.org/toc/index.html
Pocoo
值得关注的项目
PyPy
Celery
Jinja2
Read the Docs
gevent
pyenv
virtualenv
Stackless Python
Beautiful Soup
结巴中文分词
Green Unicorn
Sentry
Shovel
Pyflakes
pytest
Python 编程
pep8 Checker
Styles
PEP 8
Google Python Style Guide
Code Style from The Hitchhiker's Guide
donglongtu
V2EX  ›  Python

Python 获取 HTTP 请求的状态码(200,404 等)

  •  
  •   donglongtu · Jun 28, 2017 · 3266 views
    This topic created in 3264 days ago, the information mentioned may be changed or developed.

    Python 获取 HTTP 请求的状态码(200,404 等),不访问整个页面源码,那样太浪费资源:

    输入:www.v2ex.com 输出:200
    输入:www.v2ex.com/nonexistant 输出:404
    
    revotu
        1
    revotu  
       Jun 28, 2017
    Python 实用脚本清单 : http://www.revotu.com/python-practical-script-list.html

    http 不只有 get 方法(请求头部+正文),还有 head 方法,只请求头部。


    import httplib

    def get_status_code(host, path="/"):
    """ This function retreives the status code of a website by requesting
    HEAD data from the host. This means that it only requests the headers.
    If the host cannot be reached or something else goes wrong, it returns
    None instead.
    """
    try:
    conn = httplib.HTTPConnection(host)
    conn.request("HEAD", path)
    return conn.getresponse().status
    except StandardError:
    return None

    print get_status_code("www.v2ex.com") # prints 200
    print get_status_code("www.v2ex.com", "/nonexistant") # prints 404
    bolide2005
        2
    bolide2005  
       Jun 28, 2017
    >>> import requests
    >>> response = requests.get("http://www.baidu.com")
    >>> print response.status_code
    200
    dd99iii
        3
    dd99iii  
       Jun 30, 2017 via iPhone
    def get_status_code(url):
    r = requests.head(url)
    return r.status_code
    About   ·   Help   ·   Advertise   ·   Blog   ·   API   ·   FAQ   ·   Solana   ·   2492 Online   Highest 6679   ·     Select Language
    创意工作者们的社区
    World is powered by solitude
    VERSION: 3.9.8.5 · 34ms · UTC 00:51 · PVG 08:51 · LAX 17:51 · JFK 20:51
    ♥ Do have faith in what you're doing.