-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Build cache ERROR: failed to solve: Internal: not found #5784
Comments
Might be related to Github moving to V2 version of their API https://github.blog/changelog/2024-09-16-notice-of-upcoming-deprecations-and-changes-in-github-actions-services/ |
I did see some other mentions of upcoming cache changes. The fact that it generally does not happen on retry made me think it wasn't that. The brownout dates are certainly interesting, but it happened this morning which is not in the listed dates. |
We already have v2 implementation in BuildKit v0.20.0 . Afaics, most repositories are already using v2 when runners start, although at least initially only some repositories had support. The deadlines are for shutting down v1. |
If you have a link for your Github actions run then you can post it for more details. |
This is on a private repository unfortunately, can't link. |
@bhperry - We'd be very grateful if you would reach out to the GitHub support team, open a ticket and drop the link to your run in the private repository. We'd like to take a look to understand this behaviour. Feel free to mention this comment as well and ask the team to escalate to engineering. |
@Link- ticket submitted |
@bhperry Does this command runs within a Also does it change smth if you remove these attributes: Can you show your workflow as well please? And what is the output of |
@crazy-max Yes it does run within a run block. I have a local action exporting I'm going to try to make a simpler reproduction of this, because there's a lot of scripts internal to my organization obfuscating what's actually happening in the workflow. |
Also, found this in the
|
@crazy-max Here is the output of
I noticed that ghaction-github-runtime ends up exporting some extra envs including Unfortunately have not yet been able to repro on a different workflow. With it being a non-deterministic failure it's difficult to know for sure if it won't happen or just hasn't yet. |
Any ideas on what to look at to figure out what could be causing this? Looking at the line that is throwing the error, it seems to be more related to internal cache storage than the GHA cache. Not familiar enough with the buildkit codebase to understand why the bucket could be missing. buildkit/solver/bboltcachestorage/storage.go Lines 122 to 131 in e9af0f6
|
@bhperry This link is for the default local cache using the bolt database. Remote backends like GHA do not go through it. |
@tonistiigi That is what is pointed to in the stack trace I posted above #5784 (comment) (which I see in the buildx container logs every time this error happens). I thought maybe the cache data is stored locally in bbolt during build and then uploaded to gha. Is it a red herring? |
Based on the stacktrace, I think we should consider this unrelated to GHA changes atm. Still needs a reproduction for further debug. |
We hit this issue and had to roll back to buildkitd v0.19. Some images consistently fail to push with this message. |
@aaronlehmann Can you bisect? Are you using GHA? |
Bisecting wouldn't be easy, it takes several steps and ~1 hour to deploy this service. We are not using GHA. |
Failing to push sounds like it may be different issue than what I'm seeing. My builds push successfully but fail on the cache step. Seems to be a pretty generic error that gets used in many places. |
Contributing guidelines and issue reporting guide
Well-formed report checklist
Description of bug
Bug description
Lately I have started to see docker builds in Github Actions that use the GHA build cache fail regularly when exporting the cache at the end of a build. Re-running the build will often cause it to succeed, but we get the failures almost every day now.
Have been running with the GHA cache for a while now with no issues.
Reproduction
Happens with a variety of different image builds, some of which are very thin wrappers on existing images. So any basic Dockerfile should work.
We run the builds from within a github action, automated by python scripts. But the end result is to run commands like this:
Version information
ubuntu-20.04 github actions runner
Docker-Buildx 0.21.1
Docker Client 26.1.3
The text was updated successfully, but these errors were encountered: