scikit-learn: plot_stock_market.py fails constantly
The example has been failing for several days and we cannot see the latest doc, so I raise it here. error message from Circle
Unexpected failing examples:
/home/circleci/project/examples/applications/plot_stock_market.py failed leaving traceback:
Traceback (most recent call last):
File "/home/circleci/project/examples/applications/plot_stock_market.py", line 225, in <module>
symbol, start_date, end_date))
File "/home/circleci/project/examples/applications/plot_stock_market.py", line 85, in wrapper
return f(*args, **kwargs)
File "/home/circleci/project/examples/applications/plot_stock_market.py", line 117, in quotes_historical_google
response = urlopen(url)
File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 154, in urlopen
return opener.open(url, data, timeout)
File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 435, in open
response = meth(req, response)
File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 548, in http_response
'http', request, response, code, msg, hdrs)
File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 473, in error
return self._call_chain(*args)
File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 407, in _call_chain
result = func(*args)
File "/home/circleci/miniconda/envs/testenv/lib/python2.7/urllib2.py", line 556, in http_error_default
raise HTTPError(req.get_full_url(), code, msg, hdrs, fp)
HTTPError: HTTP Error 403: Forbidden
Is it related to #9749? Anything to improve in scikit-learn? ping @jnothman @lesteve
About this issue
- Original URL
- State: closed
- Created 6 years ago
- Comments: 21 (21 by maintainers)
I have a branch where I got the data from quandl and alphavantage (neither of them have the complete coverage of the stocks we are using in plot_stock_market.py): https://github.com/scikit-learn/scikit-learn/compare/master...lesteve:fix-plot-stock-market
I would be in favour of just hosting the csv in the github repo like I did in my branch. An alternative is to put them on figshare but then the relationship between the stock symbol and the URL is not straightforward. All the CSV files take about 1.5MB.
The plot looks visually different but broadly similar in terms of cluster structure. There are tiny differences in clusters, for example Boeing does not seem to end up in the same cluster. I am not sure what to make of the differences I have to say. IIRC there were visual differences as well when we switch from Yahoo to Google.
My branch:
Last succesful build in master: