tt-metal: Falcon40b prefill 4 chip hang with 60 layers
This commit (or the previous one, seems they have been merged together) broke the prefill. Commit before that runs 60 layers without issues.
A couple of layers run normally. Frequency is at 800mhz
To reproduce you can run on main:
pytest models/demos/falcon40b/tests/test_falcon_end_to_end.py -k "prefill and layers_60 and BFLOAT8" --timeout 1000
However, you would need to have all Falcon40b weights downloaded and prepared and this takes awhile, so we can also sync offline to see how to debug this together since I have the machine set up.
About this issue
- Original URL
- State: closed
- Created 4 months ago
- Comments: 16 (16 by maintainers)
Commits related to this issue
- #6289: fix dispatcher page calculation — committed to tenstorrent/tt-metal by aliuTT 4 months ago
- #6289: fix dispatcher page calculation — committed to tenstorrent/tt-metal by aliuTT 4 months ago
- #6289: fix dispatcher page calculation — committed to tenstorrent/tt-metal by aliuTT 4 months ago
- #6289: fix dispatcher page calculation — committed to tenstorrent/tt-metal by aliuTT 4 months ago
- #6289: fix dispatcher page calculation — committed to tenstorrent/tt-metal by aliuTT 4 months ago
- #6289: fix dispatcher page calculation — committed to tenstorrent/tt-metal by aliuTT 4 months ago
- #6289: fix dispatcher page calculation (cherry picked from commit b7dff0e1da5c4f3c7391584a06804381acca2146) — committed to tenstorrent/tt-metal by aliuTT 4 months ago
- #6289: fix dispatcher page calculation — committed to tenstorrent/tt-metal by aliuTT 4 months ago
- #6289: fix dispatcher page calculation — committed to tenstorrent/tt-metal by aliuTT 4 months ago
- #6289: fix dispatcher page calculation — committed to tenstorrent/tt-metal by aliuTT 4 months ago
- #2371: Add support to run Fast Dispatch on Idle Ethernet Cores. #2371: Enable DPRINT on idle erisc cores #2371: remove mmio chan 2,3 from idle eth cores #2371: remove dispatch channels that have li... — committed to tenstorrent/tt-metal by ubcheema 4 months ago
- #2371: Add support to run Fast Dispatch on Idle Ethernet Cores. #2371: Enable DPRINT on idle erisc cores #2371: remove mmio chan 2,3 from idle eth cores #2371: remove dispatch channels that have li... — committed to tenstorrent/tt-metal by ubcheema 4 months ago
- #2371: Add support to run Fast Dispatch on Idle Ethernet Cores. #2371: Enable DPRINT on idle erisc cores #2371: remove mmio chan 2,3 from idle eth cores #2371: remove dispatch channels that have li... — committed to tenstorrent/tt-metal by ubcheema 4 months ago
- #2371: Add support to run Fast Dispatch on Idle Ethernet Cores. #2371: Enable DPRINT on idle erisc cores #2371: remove mmio chan 2,3 from idle eth cores #2371: remove dispatch channels that have li... — committed to tenstorrent/tt-metal by ubcheema 4 months ago
- #2371: Add support to run Fast Dispatch on Idle Ethernet Cores. #2371: Enable DPRINT on idle erisc cores #2371: remove mmio chan 2,3 from idle eth cores #2371: remove dispatch channels that have li... — committed to tenstorrent/tt-metal by ubcheema 4 months ago
- #2371: Add support to run Fast Dispatch on Idle Ethernet Cores. #2371: Enable DPRINT on idle erisc cores #2371: remove mmio chan 2,3 from idle eth cores #2371: remove dispatch channels that have li... — committed to tenstorrent/tt-metal by ubcheema 4 months ago
Great, I’ll close the issue then. The commit is in main now.
Verified - both the decode demo and prefill 60 layers pass on main + cherry-pick!