Tune-up salt alive configs for heavy load
* Set of configs, supposed to increase deployment time, but reduce
false-positive rejects from master.
auth_timeout: 180
When waiting for a master to accept the minion's public key, salt will
continuously attempt to reconnect until successful. This is the timeout value,
in seconds, for each individual attempt. After this timeout expires, the minion
will wait for acceptance_wait_time seconds before trying again. Unless your master
is under unusually heavy load, this should be left at the default.
- In our case, those will covers issue with overloaded master zmq threads
master_tries: -1
The number of attempts to connect to a master before giving up.
Set this to -1 for unlimited attempts. This allows for a master to have
downtime and the minion to reconnect to it later when it comes back up.
auth_tries: 10
Number of consecutive SaltReqTimeoutError that are acceptable when trying to
authenticate.
gather_job_timeout: 20
The number of seconds to wait when the client is requesting information
about running jobs.
Prod-related: PROD-27850 (PROD:27850)
Change-Id: I9095f0847acb6dd8b5822afb2ed4b7fdd30a3274
(cherry picked from commit d71c329937f2b9de5c2454886af2735f0c83292e)
diff --git a/salt/master/single.yml b/salt/master/single.yml
index 25eaf56..8d36565 100644
--- a/salt/master/single.yml
+++ b/salt/master/single.yml
@@ -15,6 +15,8 @@
worker_threads: 40
max_open_files: 15000
command_timeout: 10
+ opts:
+ gather_job_timeout: 40
peer:
'.*':
- x509.sign_remote_certificate