Increase apiserver resource requests and limits #2581

jkbschmid · 2019-03-13T15:05:08Z

This PR is a

Bug Fix

What this PR does / why we need it:
We are using the service catalog with a catalog with two brokers and a catalog comprising a total of ~150 plans and ~5 services. With the default configuration, the apiserver is forcibly restarted every 30 mins by k8s due to an OOM error. This happens because the apiserver requires around ~35MB, but only has a resource memory limit of 30.
This PR increases the default memory resource request from 20 to 40 and the memory resource limit from 30 to 50 MB.

Merge Checklist:

New feature
- Tests
- Documentation
SVCat CLI flag
Server Flag for config
- Chart changes
- removing a flag by marking deprecated and hiding to avoid
  breaking the chart release and existing clients who provide a
  flag that will get an error when they try to update

k8s-ci-robot · 2019-03-13T15:05:13Z

Thanks for your pull request. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please follow instructions at https://git.k8s.io/community/CLA.md#the-contributor-license-agreement to sign the CLA.

It may take a couple minutes for the CLA signature to be fully registered; after that, please reply here with a new comment and we'll verify. Thanks.

If you've already signed a CLA, it's possible we don't have your GitHub username or you're using a different email address. Check your existing CLA data and verify that your email is set on your git commits.
If you signed the CLA as a corporation, please sign in with your organization's credentials at https://identity.linuxfoundation.org/projects/cncf to be authorized.
If you have done the above and are still having issues with the CLA being reported as unsigned, please email the CNCF helpdesk: [email protected]

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

k8s-ci-robot · 2019-03-13T15:05:23Z

Hi @jkbschmid. Thanks for your PR.

I'm waiting for a kubernetes-incubator or kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

jberkhahn · 2019-03-13T16:26:45Z

/ok-to-test

jboyd01 · 2019-03-13T17:25:13Z

/lgtm

jberkhahn · 2019-03-13T17:57:41Z

@jboyd01
Do you think this is a good idea? I'm not saying it isn't, but these values are already configurable in the chart, what makes the new ones any better? I imagine they'll still cause problems with enough stuff in the backend.

jimmidyson · 2019-03-13T18:08:07Z

Unless I've missed it, I think that documentation could be clearer to show how to set and update resource configurations, along with some example scenarios.

jberkhahn · 2019-03-13T18:29:26Z

Documentation on how to configure helm charts seems to be a bit beyond the scope of our project. Or do you mean a list of the possible things you can set in our helm chart?

jboyd01 · 2019-03-13T18:55:46Z

I'm sure the doc could be clearer, but I also think the limits are pretty tight. I'm all for bumping them up a bit by default.

c0d1ngm0nk3y · 2019-03-13T19:45:41Z

@jboyd01
Do you think this is a good idea? I'm not saying it isn't, but these values are already configurable in the chart, what makes the new ones any better? I imagine they'll still cause problems with enough stuff in the backend.

Do you expect much problems? We are talking about 10 MB we ask the scheduler in addition, right? Yes, it is configurable, but this exactly what you get per default if you just call helm install. I think it would be better to work out of the box for simple scenarios.

We installed the service catalog in our development landscape and it ended up to restart every 30mins. The problem was that it lost all ServiceClusterBroker in the process. In our case, those defaults would have saved us quite some time.

But I understand that it is hard/impossible to find the perfect default.

jkbschmid · 2019-03-13T20:17:23Z

Thanks for the quick replies!
Once we had identified the resource limits as the source of our issue, we found the option in the helm charts right away. So, in our case, the documentation was not an issue.
The goal of this PR was just to adjust the defaults so that other folks that try out the service catalog don't stumble over the same issue.:)

jberkhahn · 2019-03-13T20:27:52Z

/approve
my concern is really how do we know this is high enough to be a better default - maybe it should be even higher? we can try this for now, i guess

k8s-ci-robot · 2019-03-13T20:30:35Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: jberkhahn

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [jberkhahn]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

jberkhahn · 2019-03-13T20:37:43Z

weird, the travis tests passed but the git robot isn't pulling the status for some reason.
/retest

jboyd01 · 2019-03-13T21:06:17Z

@jkbschmid as I reread this and discussed with @jberkhahn I feel obligated to reply with a clarification / elaboration. Any real use of service catalog probably deserves a more advanced deployment & configuration of etcd. As you have seen, if your Service Catalog API Server container is restarted for any reason you are going to loose your etcd storage. For anything other then "play" I'd encourage the setup of a HA etcd deployment with persistent storage and then configuring the Service Catalog API Server to use it. You may have already deduced this, but I wanted to be sure it was understood and I'll try to revisit the doc we have around that.

)

Increase apiserver resource requests and limits

b0db828

k8s-ci-robot added cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. labels Mar 13, 2019

k8s-ci-robot added the needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. label Mar 13, 2019

k8s-ci-robot requested review from jberkhahn and jboyd01 March 13, 2019 15:05

k8s-ci-robot assigned jboyd01 Mar 13, 2019

k8s-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Mar 13, 2019

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Mar 13, 2019

k8s-ci-robot merged commit c000c23 into kubernetes-retired:master Mar 13, 2019

viviyww pushed a commit to viviyww/service-catalog that referenced this pull request May 10, 2019

Increase apiserver resource requests and limits (kubernetes-retired#2581

6c3ed4b

)

cblecker unassigned jboyd01 Jun 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Increase apiserver resource requests and limits #2581

Increase apiserver resource requests and limits #2581

jkbschmid commented Mar 13, 2019

k8s-ci-robot commented Mar 13, 2019

k8s-ci-robot commented Mar 13, 2019

jberkhahn commented Mar 13, 2019

jboyd01 commented Mar 13, 2019

jberkhahn commented Mar 13, 2019

jimmidyson commented Mar 13, 2019

jberkhahn commented Mar 13, 2019

jboyd01 commented Mar 13, 2019

c0d1ngm0nk3y commented Mar 13, 2019 •

edited

Loading

jkbschmid commented Mar 13, 2019

jberkhahn commented Mar 13, 2019

k8s-ci-robot commented Mar 13, 2019

jberkhahn commented Mar 13, 2019

jboyd01 commented Mar 13, 2019

Increase apiserver resource requests and limits #2581

Increase apiserver resource requests and limits #2581

Conversation

jkbschmid commented Mar 13, 2019

k8s-ci-robot commented Mar 13, 2019

k8s-ci-robot commented Mar 13, 2019

jberkhahn commented Mar 13, 2019

jboyd01 commented Mar 13, 2019

jberkhahn commented Mar 13, 2019

jimmidyson commented Mar 13, 2019

jberkhahn commented Mar 13, 2019

jboyd01 commented Mar 13, 2019

c0d1ngm0nk3y commented Mar 13, 2019 • edited Loading

jkbschmid commented Mar 13, 2019

jberkhahn commented Mar 13, 2019

k8s-ci-robot commented Mar 13, 2019

jberkhahn commented Mar 13, 2019

jboyd01 commented Mar 13, 2019

c0d1ngm0nk3y commented Mar 13, 2019 •

edited

Loading