azure-search-openai-demo: Error deploying open AI: Specified capacity of account deployment is bigger than available capacity for UsageName Tokens Per Minute (Thousands) ext-Davinci-003

Please provide us with the following information:

This issue is for a: (mark with an x)

- [X ] bug report -> please search issues before submitting
- [ ] feature request
- [ ] documentation issue or request
- [ ] regression (a behavior that used to work and stopped in a new release)

Minimal steps to reproduce

I ran azd up after initializing the project, however the openai deployment failed and I got the error message described. The other components deployed without errors. I’m not sure exactly what’s causing the error so I’m not sure how to reproduce it.

Any log messages given by the failure

The specified capacity ‘60’ of account deployment is bigger than available capacity ‘0’ for UsageName ‘Tokens Per Minute (thousands) - Text-Davinci-003’.

Expected/desired behavior

Expected behavior is for openai to be deployed without errors.

OS and Version?

Windows 10 Enterprise (OS Build 19044.2965)

azd version?

azd version 1.0.1 (commit e0cd1aca716fa5d08704beade7dcc734fe68f5f1)

Mention any other details that might be useful

About this issue

  • Original URL
  • State: closed
  • Created a year ago
  • Comments: 36

Most upvoted comments

I ran into this as well. It turns out that Azure OpenAI changed their quota system last week, and hasn’t yet updated the Bicep schema (the infrastructure-as-code language used for this repo and others) to reflect the new quota system. For now, I’ve been manually resetting the quota of my deployments before each “azd up” by going to the Azure OpenAI studio, selecting the Quotas tab, and sliding the TPM to 1K for each of them.

open ai service is “soft deleted” if you have one or a few you’ve deleted you may need to purge them to release quota

@OrionSehn Please subscribe to this issue in the azd repo, as that seems to be what you’re looking for: https://github.com/Azure/azure-dev/issues/337 Thanks for the feedback!

I’m completely new to Azure.

How do you manually reset the quota of your deployment, when you havent deployed it? In my Azure OpenAI studio, there is no deployment visible, before and after deploying this demo with “azd up”.

I only see the deployment in portal.azure.com.

As I already mentioned, I’m new to Azure / Azure OpenAI and I really don’t understand where to look. I can set an environment-name for the deployment, which creates a ressource-group, but that’s all I understand. I can also see the quota TPMs per region and subscription, which currently sits at 120/120. Since I have no deployment running in East US and it still sits at 120/120, I wonder how I can determine which deployment currently takes up all quotas? For West Europe I can see the deployments, taking up the TPMs, but for East US I can’t. Can it be that my account doesn’t have sufficent permissions? My company created it for tests.

@fakoe, temporary solution is to go to Azure AI studio, Deployments, and for each Edit deployment, Advanced, set “Tokens per Minute Rate Limit” from 120 to e.g. 2. Then deploy your accelerator with “azd up”. Also open the project it in VSCode, it will suggest you cool extensions

If anyone is still experiencing this and think its an error, here is a related issue: https://github.com/Azure/bicep-types-az/issues/1660

MicrosoftTeams-image

follow this to resolve the issue

the issue is still here. But purging fixes the issue. How to purge , see this : https://learn.microsoft.com/en-us/azure/cognitive-services/manage-resources?tabs=azure-portal

It looks like https://github.com/Azure-Samples/azure-search-openai-demo/pull/322 might address this, I’ll be testing it out shortly.

I’m completely new to Azure.

How do you manually reset the quota of your deployment, when you havent deployed it? In my Azure OpenAI studio, there is no deployment visible, before and after deploying this demo with “azd up”.

I only see the deployment in portal.azure.com.

As I already mentioned, I’m new to Azure / Azure OpenAI and I really don’t understand where to look. I can set an environment-name for the deployment, which creates a ressource-group, but that’s all I understand. I can also see the quota TPMs per region and subscription, which currently sits at 120/120. Since I have no deployment running in East US and it still sits at 120/120, I wonder how I can determine which deployment currently takes up all quotas? For West Europe I can see the deployments, taking up the TPMs, but for East US I can’t. Can it be that my account doesn’t have sufficent permissions? My company created it for tests.