dvc.org: regular: fix expired and broken links
See https://github.com/iterative/dvc.org/actions/workflows/link-check-all.yml
Run iterative/link-check.action@v0.7
* content/blog/2020-07-16-devops-for-data-scientists.md
- http://engineering.microsoft.com/devops/ (404)
* content/blog/2020-11-11-november-20-dvc-heartbeat.md
- https://torontomachinelearning.com/ (409)
- https://torontomachinelearning.com/ (409)
* content/blog/2020-12-30-december-20-community-gems.md
- https://github.com/iterative/cml/blob/master/docker/Dockerfile (404)
* content/blog/2021-02-22-cml-runner-prerelease.md
- https://github.com/iterative/cml/blob/master/docker/Dockerfile (404)
* content/blog/2021-04-16-april-21-dvc-heartbeat.md
- https://weworkremotely.com/remote-jobs/iterative-senior-frontend-engineer (404)
* content/docs/cml/self-hosted-runners.md
- https://github.com/iterative/cml/blob/master/docker/Dockerfile (404)
* content/docs/cml/usage.md
- https://github.com/iterative/cml/blob/master/docker/Dockerfile (404)
About this issue
- Original URL
- State: closed
- Created 3 years ago
- Reactions: 2
- Comments: 15 (15 by maintainers)
Commits related to this issue
- fix outdated CML links - part of #2486 — committed to iterative/dvc.org by casperdcl 3 years ago
- fix outdated CML links (#2488) - part of #2486 — committed to iterative/dvc.org by casperdcl 3 years ago
- fix outdated CML links (#2488) - part of #2486 — committed to iterative/dvc.org by casperdcl 3 years ago
- hotfix: content/docs/... broken links per https://github.com/iterative/dvc.org/issues/2486#issuecomment-1067192170 — committed to iterative/dvc.org by jorgeorpinel 2 years ago
- hotfix: content/docs/... broken links (#3355) per https://github.com/iterative/dvc.org/issues/2486#issuecomment-1067192170 — committed to iterative/dvc.org by jorgeorpinel 2 years ago
- hotfix: content/docs/... broken links (#3355) per https://github.com/iterative/dvc.org/issues/2486#issuecomment-1067192170 — committed to iterative/dvc.org by jorgeorpinel 2 years ago
- Update mmcv.md Fix broken link, per https://github.com/iterative/dvc.org/issues/2486#issuecomment-1226624396 — committed to iterative/dvc.org by daavoo 2 years ago
- Update mmcv.md (#3895) Fix broken link, per https://github.com/iterative/dvc.org/issues/2486#issuecomment-1226624396 — committed to iterative/dvc.org by daavoo 2 years ago
Thanks @rogermparent , ping me if you have doubts about certain links, I can help with them in Slack. Should be quick.
yes, websites (and everyone tbh) is free to take and edit any blog post and create a PR
yep, it’s def p2. We should keep our home clean though. Together. If no one pays attention to anything website quality will deteriorate quick and this is not acceptable. There should be an easy process to do this. And if it’s done more or less regularly I doubt it would be taking >1-5minutes a week from anyone.
One suggestion btw is to remove or archive (put a message that it’s outdated, not maintained, remove from search, and the landing page) older blog posts, especially things like gems, heartbeats. Eventually garbage collect them completely. This was we can put an exception into link checker to avoid these outdated posts at all.
I suppose it would depend on the definition of “archive”, but most definitions I can imagine are things we can do.
I’m thinking we want the latter, so I’ll default to thinking about link check improvements to enable that. Adding in the ability to specify input files is an easy start, we’ll need at least that even if we have the date checking done by GitHub Actions.
If we want a post archiving feature that’s more involved I can do that too, but it seems we’re mostly just talking about link check here.
And to add more color to this, some actual stats, top visited pages, according to plausible:
some of them are quite old (shtab release!), some of them will be used as tutorials (GCP, AWS - we should have done this in docs not as blogs in the first place, and I would love dev rel team to also participate in this - it’s clear that those topic are important)
everything else is <100 visits and which are very old we can start archiving at some point.
@rogermparent per @shcheklein suggestion, to help with this one, is it possible to set up the blog to continually archive (somehow as opposed to just trash. There may be reasons we want to revisit these) Heartbeat and Community Gems posts that are more than a year old?
This should help with link problems, clean up the old differing in style images, and hopefully eliminate most of the noise problem whatever links that show up after that would be worth changing as the material is either more recent or from a tutorial or release.
Old Heartbeat and Gems posts are not really revisited like the release and tutorial posts, so just those two types of posts would drop off.
Maybe @iterative/websites should be free to remove broken links from old blogs as needed so the check stops failing.
I’ll fix the broken links from content. @jendefig what should we do about all the broken links from old blog posts, want to look or re-assign within devrel? Thanks
p.s. latest list: https://github.com/iterative/dvc.org/runs/5530668423?check_suite_focus=true
To my mind it’s more important to have a healthy workforce. Spending time updating links on old blogs that aren’t being looked at very much and links that probably have a low click through rate anyway falls into the P2-nice-to-have category. Our mountain of higher priority tasks is large. This is very low priority for me.
It’s in my back log of to dos in simmer mode.
Ping @jendefig and @daavoo 🙂
The link-check is showing some more broken doc links (Went ahead and tested them to confirm they weren’t false negatives):
cc @iterative/docs
yes, I would maintain. This is good to have a healthy website to my mind.
Seems like the most important ones (last 2) are about CML cc @casperdcl
Do we want to maintain old blogs like this though? I guess these are not so old so why not fix them indeed…