Mateusz Matuszkowiak [Mon, 23 Jul 2018 10:46:34 +0000 (12:46 +0200)]
Fixed WaterMark alerts regarding disk space usage
Change-Id: I4211048cc6c06fc041a03c35624af45ea0817f31
Closes-Bug: PROD-21584
Mateusz Matuszkowiak [Thu, 21 Jun 2018 09:36:29 +0000 (11:36 +0200)]
Updated Elasticsearch dashboard to align with recent rules
Change-Id: Ibddf450a61ba8e253064ab283e8f852e6a6d0fe5
Related-Bug: PROD-20879
Mateusz Matuszkowiak [Thu, 14 Jun 2018 13:18:55 +0000 (15:18 +0200)]
Dashboard rationalization for Elasticsearch
Change-Id: Ib48467e7b9f03bb92c1e0ad7820e09946a28311c
Closes-Bug: PROD-20077
Ondrej Smola [Tue, 12 Jun 2018 15:09:55 +0000 (15:09 +0000)]
Merge "Replace Salt elasticsearch_index_template state deprecated since 2017.7"
Martin Polreich [Tue, 12 Jun 2018 12:37:44 +0000 (14:37 +0200)]
Replace Salt elasticsearch_index_template state deprecated since 2017.7
Change-Id: I2a7f608e8bb4745a99e52887c81fe3611c34f739
Michal Kobus [Thu, 7 Jun 2018 12:16:34 +0000 (14:16 +0200)]
Cosmetic changes for alerts
Change-Id: Ie84879621a19f04aa8210b4d08511a29888467c0
Closes-bug: PROD-20466
Mateusz Matuszkowiak [Fri, 11 May 2018 12:19:09 +0000 (14:19 +0200)]
Alerts rationalization for Elasticsearch
Change-Id: I1ab91bb31e622dd2cfafa6c6f16895700727d286
Closes-Bug: PROD-19796
Petr Michalec [Thu, 29 Mar 2018 06:25:57 +0000 (08:25 +0200)]
Update metadata.yml with correct URL
Change-Id: Ic07261d09ce61700a4bf9b4688436cfb8e9dfaad
mcp-jenkins [Fri, 2 Mar 2018 16:33:04 +0000 (16:33 +0000)]
Merge "Fix wrong package name for curator"
Mateusz Matuszkowiak [Wed, 28 Feb 2018 11:22:35 +0000 (12:22 +0100)]
Gzip rolling logs for ES 5.x
Change-Id: If7515a14a61be54e1bdb2775e53c353d5d6b6779
Closes-Bug: PROD-18146
Dmitry Kalashnik [Tue, 20 Feb 2018 15:17:55 +0000 (19:17 +0400)]
Add state for updating Elasticsearch index templates
To prevent index template from constant or accidential update
add separate salt state to explicit use in order to update.
To use run:
salt -C 'I@elasticsearch:client' state.sls elasticsearch.client.update_index_templates
Change-Id: I19aca3c6901f79ac22457a30aada0904057de5cc
Closes-Bug: PROD-17954
vgusev [Mon, 12 Feb 2018 15:10:34 +0000 (19:10 +0400)]
Fix wrong package name for curator
Related-Prod: PROD-17706
Change-Id: Idfd3200a855896a40bfad11eeabc4c764f6cdc6f
Petr Michalec [Fri, 16 Feb 2018 15:14:49 +0000 (15:14 +0000)]
Merge "Use new image for kitchen tests"
Martin Polreich [Fri, 16 Feb 2018 13:54:21 +0000 (14:54 +0100)]
Use new image for kitchen tests
Change-Id: I84a3bab57955fbfd794f9aa123d3fcbf44581905
Filip Pytloun [Thu, 15 Feb 2018 14:13:12 +0000 (15:13 +0100)]
Update Makefile
Jakub Josef [Thu, 8 Feb 2018 12:22:14 +0000 (13:22 +0100)]
Fixed shebang kernel limitation with virtualenv during test run (https://github.com/pypa/virtualenv/issues/596)
Change-Id: Iedbd1ee502588f6454c660b3ffa9209d90a34bf5
Filip Pytloun [Mon, 15 Jan 2018 12:14:44 +0000 (12:14 +0000)]
Merge "Remove salt-master, reclass from pkg dependencis"
Petr Michalec [Wed, 10 Jan 2018 09:36:23 +0000 (09:36 +0000)]
Merge "Optimize kitchen tests for Travis CI"
Petr Michalec [Tue, 9 Jan 2018 08:10:38 +0000 (09:10 +0100)]
Remove salt-master, reclass from pkg dependencis
Change-Id: I310e685e7614ba4eecce8e4d7ea50a1a5fe0c89c
Martin Polreich [Fri, 5 Jan 2018 12:04:24 +0000 (13:04 +0100)]
Optimize kitchen tests for Travis CI
Change-Id: I93dee77ae8fe1263f2b4983cc5cf9e715e5848b5
Jakub Josef [Wed, 27 Dec 2017 17:58:46 +0000 (17:58 +0000)]
Merge "provide index creation without date-mapping"
mcp-jenkins [Thu, 21 Dec 2017 12:01:11 +0000 (12:01 +0000)]
Merge "Switch Elasticsearch to 5 version"
Vnaumov [Wed, 20 Dec 2017 12:32:27 +0000 (15:32 +0300)]
provide index creation without date-mapping
force creating allow to create date-marked indicies
its useful by defaul and also provide issues due incorrect es-salt-formula actions
see related bug and other refs in this topic
bug: https://mirantis.jira.com/browse/PROD-16661
Change-Id: I8ca7d4358b5ac1935989512cd197cec3155662f5
Jakub Josef [Fri, 15 Dec 2017 15:58:29 +0000 (16:58 +0100)]
Fixed typo in run_tests.sh file
Change-Id: I49f2d2c39214b4df31a6e8c6c110cd9e5dd724ca
vgusev [Wed, 6 Dec 2017 08:20:36 +0000 (12:20 +0400)]
Switch Elasticsearch to 5 version
Change-Id: Ic5f0937b6288b181a6b8708a85173ad15eb4d65c
Ildar Svetlov [Tue, 5 Dec 2017 10:08:42 +0000 (14:08 +0400)]
Fix bugs in ES Grafana dashboard
Change-Id: I6aeb8ebe3ccf1ad6e753339a98efaa9811c92757
Bartosz Kupidura [Wed, 29 Nov 2017 14:46:39 +0000 (14:46 +0000)]
Merge "Move fluentd under agent role"
mcp-jenkins [Wed, 22 Nov 2017 21:45:33 +0000 (21:45 +0000)]
Merge "Add a possibility to deploy Elasticsearch 5.x version"
Bartosz Kupidura [Wed, 22 Nov 2017 13:50:50 +0000 (14:50 +0100)]
Move fluentd under agent role
Change-Id: I4060dca9e30d01e0b850c11faf7d06aa768eea78
vgusev [Wed, 8 Nov 2017 09:21:39 +0000 (13:21 +0400)]
Add a possibility to deploy Elasticsearch 5.x version
Elasticsearch 2.x version is deployed by default
Change-Id: I78e714807cf5925a9619961ab145f27ef0210652
Ildar Svetlov [Wed, 8 Nov 2017 22:31:47 +0000 (02:31 +0400)]
Add fluentd support to ES
Change-Id: I2a0e7b4f6cdfdb3fda51ed1983e68b173b600441
Martin Polreich [Wed, 18 Oct 2017 11:24:06 +0000 (11:24 +0000)]
Merge "Add prometheus main dashboard"
Martin Polreich [Fri, 13 Oct 2017 09:05:25 +0000 (11:05 +0200)]
Add prometheus main dashboard
Change-Id: I27e43436e2d529c49992504da7d55b326e5c6b24
Dmitry Kalashnik [Fri, 13 Oct 2017 09:58:35 +0000 (13:58 +0400)]
Add threshold to ElasticSearch alarms
Change-Id: Id25f13cad6e5646a9cdc8f21fb8af86533ae6f7f
Partial-Bug: PROD-15203
Olivier Bourdon [Tue, 3 Oct 2017 06:35:49 +0000 (08:35 +0200)]
Fix typo for proper config syntax
Change-Id: I34ffea1ec635e781f846312a3280fb609f131766
Simon Pasquier [Mon, 4 Sep 2017 08:59:06 +0000 (10:59 +0200)]
Sort hosts by alphabetical order in Grafana
Change-Id: I15625e74d1ae042467e684c852cd73d679d8c278
Martin Polreich [Fri, 1 Sep 2017 09:37:05 +0000 (11:37 +0200)]
Fix name in .kitchen.travis.yml file
Change-Id: I88ed86698995136c1b04d90e80d9b236a1d5c8b9
Martin Polreich [Tue, 25 Jul 2017 11:48:37 +0000 (11:48 +0000)]
Merge "Move suites definition from .travis.yml to .kitchen.yml"
Martin Polreich [Mon, 17 Jul 2017 09:06:45 +0000 (11:06 +0200)]
Move suites definition from .travis.yml to .kitchen.yml
Change-Id: I3b0ed2e5996e6e5027d7c4e836df97874e5c45f9
Simon Pasquier [Thu, 13 Jul 2017 09:14:18 +0000 (11:14 +0200)]
Split between local and remote Telegraf agents
The remote agent is now in charge of collecting the cluster health
metrics. This makes the alerting rules on cluster health more robust
because they don't have to aggregate metrics anumore.
Change-Id: I1dcd1801038e5e580402a70fb9325f39fac02c85
Depends-On: I473a916178ec4f9ea44951975f55fff5ee68e687
Dmitry Kalashnik [Tue, 11 Jul 2017 11:14:08 +0000 (15:14 +0400)]
Change min to max as far as the largest metric value is the worst
Change-Id: Ib37aeada66519006b7b5a817625bda416b93ac57
Dennis Dmitriev [Thu, 6 Jul 2017 18:07:27 +0000 (21:07 +0300)]
Update run_tests.sh to the latest revision
Change-Id: Ib30706f305a8f8bd7d075ed1cee04e8e0d6ab383
Swann Croiset [Wed, 14 Jun 2017 08:52:49 +0000 (10:52 +0200)]
Fix Grafana dashboard
This displays number of documents per host
Change-Id: I5db9abcaf77f3fce2472e1944e220237ab678908
Filip Pytloun [Tue, 13 Jun 2017 08:41:04 +0000 (10:41 +0200)]
Merge pull request #32 from Martin819/master
Optimize 'noservices' condition
Martin Polreich [Thu, 8 Jun 2017 13:16:16 +0000 (15:16 +0200)]
Optimize 'noservices' condition
Filip Pytloun [Tue, 6 Jun 2017 14:48:20 +0000 (16:48 +0200)]
Merge pull request #31 from corest/create_index
Extend template definition options
Volodymyr Stoiko [Tue, 6 Jun 2017 06:56:56 +0000 (09:56 +0300)]
Extend template definition options
This change adds:
- support for template definition from the metadata
- forcing index creation from the defined template
Swann Croiset [Fri, 26 May 2017 15:49:32 +0000 (17:49 +0200)]
Configure Prometheus alerts for Elasticsearch
2 kinds of alers:
- cluster health status
- disk allocation watermark (per node)
Change-Id: I0d168ba899d8a168543c8ba0a63793001e46267b
Swann Croiset [Wed, 24 May 2017 08:31:27 +0000 (10:31 +0200)]
Add Grafana dashboard with prometheus datasource
Change-Id: Id7852fae234ee6a13cbb657da6a76ef19095baf7
Swann Croiset [Mon, 22 May 2017 08:38:37 +0000 (10:38 +0200)]
Add Telegraf support
Change-Id: I555974d19c7b8cb48f2334551a202f1cc0885f72
Tomáš Kukrál [Thu, 11 May 2017 19:58:05 +0000 (21:58 +0200)]
fix meta/salt with missing pillar
Change-Id: I32018e1b1701530ae0369750d54779245f77f693
Martin Polreich [Wed, 10 May 2017 09:48:29 +0000 (09:48 +0000)]
Merge "Manage minion.d using support metadata"
Simon Pasquier [Wed, 3 May 2017 14:39:34 +0000 (16:39 +0200)]
Install JRE headless package
Because the elasticsearch package from Elastic doesn't have a
dependency on it.
Change-Id: I416f5e7760652899f8368763a5315729fd9cccde
Filip Pytloun [Wed, 3 May 2017 13:21:03 +0000 (15:21 +0200)]
Manage minion.d using support metadata
Change-Id: Iea239435bde1572e072c97d1376eae31a8e3a010
Petr Michalec [Tue, 2 May 2017 09:39:36 +0000 (11:39 +0200)]
Merge pull request #28 from Martin819/master
Added tests script and sample pillars, Kitchen tests and Travis
Petr Michalec [Tue, 2 May 2017 09:02:38 +0000 (11:02 +0200)]
Apply review comments
Martin Polreich [Tue, 11 Apr 2017 11:27:30 +0000 (13:27 +0200)]
Added tests script and sample pillars, Kitchen tests and Travis
Swann Croiset [Thu, 30 Mar 2017 09:42:23 +0000 (11:42 +0200)]
Update Grafana dashboard with performance metrics
Change-Id: Ibc070315001ff47ec667aa71b1aea692f48295db
Simon Pasquier [Tue, 21 Mar 2017 13:26:14 +0000 (14:26 +0100)]
Fix typo in metadata/service/client.yml
Change-Id: I068e0f74bea0e74ad18026fa7721119a93c915ee
Simon Pasquier [Mon, 13 Mar 2017 09:35:09 +0000 (10:35 +0100)]
Add 'region' field to the index templates
Change-Id: I70bcf8f708d005384bec2988caf4c96f81a609ea
Swann Croiset [Wed, 22 Feb 2017 16:56:03 +0000 (17:56 +0100)]
Fix index template for log messages
Change-Id: I7c00d4acf999709551076a918bb1956d1ce4b973
Simon Pasquier [Thu, 9 Feb 2017 15:31:03 +0000 (16:31 +0100)]
Fix typo in notification index template
Change-Id: If405274c1ce97f1446247622278cc86780362f5d
Filip Pytloun [Thu, 2 Feb 2017 12:02:03 +0000 (13:02 +0100)]
Unify Makefile, .gitignore and update readme
Simon Pasquier [Fri, 27 Jan 2017 12:39:43 +0000 (13:39 +0100)]
Fix index template for notification messages
Change-Id: I6ec81d41d52427ab68484e5482e3c14388b58963
Simon Pasquier [Fri, 27 Jan 2017 12:15:53 +0000 (13:15 +0100)]
Add index template for CADF audit messages
Change-Id: Ia4289468a6022e0bfeea166d807385aa170d3888
Guillaume [Thu, 12 Jan 2017 14:38:39 +0000 (15:38 +0100)]
Merge pull request #27 from thouveng/fix-grafana-template
Fix templating to get the name of nodes
Guillaume Thouvenin [Mon, 9 Jan 2017 12:41:48 +0000 (13:41 +0100)]
Fix templating to get the name of nodes
This patch modifies the query used to get the name of Elasticsearch
instances.
pasquier-s [Tue, 27 Dec 2016 16:01:00 +0000 (17:01 +0100)]
Merge pull request #26 from ityaptin/master
Enable script supporting in request
Ilya Tyaptin [Wed, 21 Dec 2016 10:02:25 +0000 (14:02 +0400)]
Enable script supporting in request
This feature is needed for resources updating in Telemetry
pasquier-s [Thu, 8 Dec 2016 14:25:57 +0000 (15:25 +0100)]
Merge pull request #25 from elemoine/availability-of-members
Use the availability_of_members policy
Éric Lemoine [Wed, 7 Dec 2016 13:06:27 +0000 (13:06 +0000)]
Use the availability_of_members policy
Use the availability_of_members policy for the elasticsearch_service
alarm cluster definition.
pasquier-s [Tue, 6 Dec 2016 08:37:15 +0000 (09:37 +0100)]
Merge pull request #24 from simonpasquier/fix-cluster-policy
Change policy for the elasticsearch_service cluster
thouveng [Mon, 5 Dec 2016 09:38:27 +0000 (10:38 +0100)]
Merge pull request #23 from thouveng/seperate-config-and-objects
Separate the configuration and other part fo the client
Simon Pasquier [Fri, 2 Dec 2016 10:51:17 +0000 (11:51 +0100)]
Change policy for the elasticsearch_service cluster
This change updates the policy to availability_of_members to trigger an
alert whenever a node is down.
pasquier-s [Fri, 2 Dec 2016 10:10:38 +0000 (11:10 +0100)]
Merge pull request #22 from simonpasquier/extend-curator-config
Allow to configure master_only for the curator
Guillaume Thouvenin [Fri, 2 Dec 2016 09:32:57 +0000 (10:32 +0100)]
Separate the configuration and other part of the client
This patch separates the configuration of the client and the creation of
the Kibana objects into Elasticsearch. Now we need to call the state
client.service to configure the client, restart the service and then
apply the client state.
Simon Pasquier [Fri, 2 Dec 2016 08:56:08 +0000 (09:56 +0100)]
Allow to configure master_only for the curator
pasquier-s [Wed, 30 Nov 2016 15:34:40 +0000 (16:34 +0100)]
Merge pull request #21 from simonpasquier/fix-elastic-alarms
Evaluate cluster alarms on the remote_collector
Simon Pasquier [Mon, 28 Nov 2016 16:39:25 +0000 (17:39 +0100)]
Evaluate cluster alarms on the remote_collector
This change also splits the collectd support metadata to collect node
metrics from the local node(s) and cluster metrics from the remote
collector node.
Filip Pytloun [Fri, 25 Nov 2016 16:32:50 +0000 (17:32 +0100)]
Merge pull request #20 from tcpcloud/stacklight
Stacklight
thouveng [Fri, 25 Nov 2016 10:34:19 +0000 (11:34 +0100)]
Merge pull request #18 from thouveng/use-new-grafana-convention
Put Grafana dashboards into their own directory
pasquier-s [Fri, 25 Nov 2016 08:02:02 +0000 (09:02 +0100)]
Merge pull request #19 from simonpasquier/stacklight
Merge master into stacklight
Simon Pasquier [Thu, 24 Nov 2016 16:09:12 +0000 (17:09 +0100)]
Merge remote-tracking branch 'upstream/master' into stacklight
Guillaume Thouvenin [Thu, 24 Nov 2016 15:44:17 +0000 (16:44 +0100)]
Put Grafana dashboards into their own directory
Éric Lemoine [Thu, 24 Nov 2016 15:15:16 +0000 (16:15 +0100)]
Merge pull request #16 from elemoine/stacklight-dashboard
Use better InfluxDB queries
Éric Lemoine [Thu, 24 Nov 2016 15:14:51 +0000 (16:14 +0100)]
Merge pull request #17 from SwannCroiset/nagios-host
Add nagios_host dimension
Swann Croiset [Thu, 24 Nov 2016 15:13:31 +0000 (16:13 +0100)]
Add nagios_host dimension
Éric Lemoine [Thu, 24 Nov 2016 14:01:32 +0000 (14:01 +0000)]
Use better InfluxDB queries
Use better InfluxDB queries by not using GROUP_BY when not necessary.
Éric Lemoine [Thu, 24 Nov 2016 12:59:35 +0000 (13:59 +0100)]
Merge pull request #14 from elemoine/stacklight-alarm
Add alarms and alarm clusters
Filip Pytloun [Wed, 23 Nov 2016 16:51:26 +0000 (17:51 +0100)]
Merge pull request #13 from thouveng/install_python_client
Salt module elasticsearch depends on elasticsearch-py
Guillaume Thouvenin [Wed, 23 Nov 2016 08:38:22 +0000 (09:38 +0100)]
Salt module elasticsearch depends on elasticsearch-py
This patch installs the python client.
Éric Lemoine [Wed, 23 Nov 2016 16:04:58 +0000 (17:04 +0100)]
Merge pull request #15 from elemoine/stacklight-dashboard
Remove fs-related panels in Grafana
Éric Lemoine [Wed, 23 Nov 2016 15:17:29 +0000 (15:17 +0000)]
Remove fs-related panels in Grafana
Éric Lemoine [Wed, 23 Nov 2016 14:55:10 +0000 (14:55 +0000)]
Add alarms and alarm clusters
thouveng [Mon, 21 Nov 2016 09:07:11 +0000 (10:07 +0100)]
Merge pull request #10 from thouveng/add-index-template
Add Elasticsearch index templates
Guillaume Thouvenin [Thu, 17 Nov 2016 14:59:28 +0000 (15:59 +0100)]
Add Elasticsearch index templates
This patch adds three index templates that are used by StackLight.
Éric Lemoine [Fri, 18 Nov 2016 10:57:11 +0000 (11:57 +0100)]
Merge pull request #12 from thouveng/add-missing-json
Add missing JSON format for dashboard
Guillaume Thouvenin [Fri, 18 Nov 2016 09:27:56 +0000 (10:27 +0100)]
Add missing JSON format for dashboard
Filip Pytloun [Fri, 18 Nov 2016 08:46:13 +0000 (09:46 +0100)]
Merge pull request #11 from thouveng/keep-default-value-for-index
Use default value for shards and replicas
Guillaume Thouvenin [Thu, 17 Nov 2016 15:47:57 +0000 (16:47 +0100)]
Use default value for shards and replicas
This patch removes the override of the shards. This will be set
in the model because it depends of your infrastructure.