lighthouse: OOM in ImageElements gatherer
We were getting OOMs in LR and I managed to confidently bisect down to the commit where #11188 was merged. (its core(image-elements): collect CSS sizing, ShadowRoot, & position)
node lighthouse-cli http://cosmetiqon.gr/ --only-audits=unsized-images -G
Here’s one URL where this can sometimes OOM, though I definitely can’t get an OOM locally. I’m not entirely sure which context is getting the OOM… the page or Lighthouse.
I do know that if I comment out these lines…
…the imageGatherer takes 2s instead of 28s.
I attempted to do some memory debugging but didn’t get too far. Still requires a bit of investigation
About this issue
- Original URL
- State: closed
- Created 4 years ago
- Comments: 16 (6 by maintainers)
isCssis a significant speed up, enough to reland this change. https://github.com/GoogleChrome/lighthouse/issues/11289quick followup we will do is sort the elements by image size, then apply a sensible time budget to fetching source rules. https://github.com/GoogleChrome/lighthouse/pull/11340#issuecomment-682259063
I did some exploring on this issue, but I couldn’t find (& don’t think I have) access to LR so this is coming from observing what happens on my local machine:
I don’t know if the slow down from
getMatchedStylesForNodehas to do with the OOM issue, but my intuition believes they might be two separate things to consider, especially after reading the performance issues previously encountered when usinggetMatchedStylesForNodeIn the
font-sizeaudit, as far as I could tell, we optimize how many times we actually callgetMatchedStylesForNode, which is not something I did when I wroteunsized-images, because I didn’t realize how slowgetMatchedStylesForNodecan be. In order to improve the runtime ofunsized-imagesby reducing calls togetMatchedStylesForNodeone optimization that I think we should include is to changeto
since we don’t currently check css background-images in
unsized-imagesanyway, andcssWidth/cssHeightaren’t as relevant to background-images because of background-repeat & parent element sizingAdditionally, I agree with @patrickhulce about
or other workarounds that can reduce the total calls to
getMatchedStylesForNode.I noticed that a large amount of the ImageElements in
http://cosmetiqon.gr/had the samesrcbecause they were the default gif for the site’s lazy loaded images. There might be potential here to reduce the calls togetMatchedStylesForNode, i.e. caching theCSS.GetMatchedStylesForNodeResponsefor sibling nodes that have the same CSS class (might make OOM worse), or not callinggetMatchedStylesForNodeon lazy loaded images outside of the viewport (not sure if this is what we’d want to encourage)As for the OOM issue, I had some leads I could think of:
unsized-imagesonhttp://cosmetiqon.gr/there was a handful of errors likethat disappear after adding
&& !element.isCss. Is there a possibility for a memory leak caused from too many errors?to
In worst case this somehow causes a circular reference since
nodeIdwas declared earlier & in best case this is just a nitasync afterPass(passContext, loadData)withinimage-elements.jsimage-elements.js:I JSON.stringified instances of
matchedRulesand found sizes ranging from ~30000 chars/bytes to ~200000 chars/bytes with median fitting between ~100000-150000 chars/bytes for a page such ashttp://cosmetiqon.gr/Based off a 150000 byte ballpark for each instance of
matchedRulesinhttp://cosmetiqon.gr/, and the fact that it has ~100 ImageElements, we get 150000 * 100 -> ~15MBI do not know if this is a reasonable use of memory / cache when running lighthouse or LR, & whether we do save all
matchedRules, I’ll check ndb later to see if this happens locallyOnce we have some urls we can start digging in more…
I’d start by adding timing marks around these three areas:
DOM.pushNodeByPathToFrontendCSS.getMatchedStylesForNodegetEffectiveSizingRuleIt’s unfortunate we are working from a devtools node path here. Perhaps there’s a way to grab the DOM snapshot (must verify that the width/height properties from the snapshot don’t include intrinsic image sizes) and then connect that data to the image element we scraped.
Either:
isExpicitlySized(no need to get the actual size, audit doesn’t care): ImageElements would haveisExpicitlySizedset to true iff the snapshot for that element has a value for width and height.After discussing with @lemcardenas, we’re going to revert ##11217 and #11188 before we ship 6.3.0