scikit-learn: plot_stock_market.py fails constantly

The example has been failing for several days and we cannot see the latest doc, so I raise it here. error message from Circle

Unexpected failing examples:
/home/circleci/project/examples/applications/plot_stock_market.py failed leaving traceback:
Traceback (most recent call last):
  File "/home/circleci/project/examples/applications/plot_stock_market.py", line 225, in <module>
    symbol, start_date, end_date))
  File "/home/circleci/project/examples/applications/plot_stock_market.py", line 85, in wrapper
    return f(*args, **kwargs)
  File "/home/circleci/project/examples/applications/plot_stock_market.py", line 117, in quotes_historical_google
    response = urlopen(url)
  File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 154, in urlopen
    return opener.open(url, data, timeout)
  File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 435, in open
    response = meth(req, response)
  File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 548, in http_response
    'http', request, response, code, msg, hdrs)
  File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 473, in error
    return self._call_chain(*args)
  File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 407, in _call_chain
    result = func(*args)
  File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 556, in http_error_default
    raise HTTPError(req.get_full_url(), code, msg, hdrs, fp)
HTTPError: HTTP Error 403: Forbidden

Is it related to #9749? Anything to improve in scikit-learn? ping @jnothman @lesteve

About this issue

  • Original URL
  • State: closed
  • Created 6 years ago
  • Comments: 21 (21 by maintainers)

Most upvoted comments

I have a branch where I got the data from quandl and alphavantage (neither of them have the complete coverage of the stocks we are using in plot_stock_market.py): https://github.com/scikit-learn/scikit-learn/compare/master...lesteve:fix-plot-stock-market

I would be in favour of just hosting the csv in the github repo like I did in my branch. An alternative is to put them on figshare but then the relationship between the stock symbol and the URL is not straightforward. All the CSV files take about 1.5MB.

The plot looks visually different but broadly similar in terms of cluster structure. There are tiny differences in clusters, for example Boeing does not seem to end up in the same cluster. I am not sure what to make of the differences I have to say. IIRC there were visual differences as well when we switch from Yahoo to Google.

My branch: my_branch

Last succesful build in master: master