2018-01-31 22:52:50 +00:00
# HELK [Alpha]
2018-06-12 05:28:26 +00:00
v0.1.6-alpha12132018
HELK base image
+ Updated to 0.0.3
HELK ELK Version
+ Now using 6.5.3 official ELK Docker Images (https://www.elastic.co/blog/elastic-stack-6-5-3-released)
helk_install
+ Users can now select between two deployments:
++ helk-kibana-analysis (KAFKA + KSQL + ELK + NGNIX + ELASTALERT)
++ helk-kibana-notebooks (KAFKA + KSQL + ELK + NGNIX + ELASTALERT + SPARK + JUPYTER)
+ Fixed https://github.com/Cyb3rWard0g/HELK/issues/131 . Users can now set up the Kibana UI User password during installation. Also, user can set the Elasticsearch elastic account password when using the Trial license option.
helk-elastalert
+ Elastalert deployed and ready to use with SIGMA integration. Blog available at https://medium.com/@Cyb3rWard0g
helk-elasticsearch
+ consolidated main configs in one
+ added more environment variables for ELASTIC_PASSWORD and default values in case it is not used to be compatible with the default values applied to HELK.
helk-logstash
+ updated to 6.5.3
+ simplified pipeline to have only one folder
+ logstash-entrypoint script can now enable elastic password on all logstash output conf files.
+ New environment variables (ELASTIC_PASSWORD, ELASTIC_HOST, ELASTIC_PORT)
helk-nginx
+ split the default config for the two deployment options (helk-kibana-analysis (trial/base) and helk-kibana-notebook-analysis (trial/base)
helk-kibana
+ Updated to version 6.5.3
+ Added new environment variables (ELASTICSEARCH_URL, SERVER_HOST, SERVER_PORT, ELASTIC_PASSWORD, ELASTIC_HOST, ELASTIC_PORT, ELASTICSEARCH_USERNAME, ELASTICSEARCH_PASSWORD, KIBANA_UI_PASSWORD) and logic to make the build more dynamic
helk-jupyter
+ updated Jupyterlab to 0.35.4
+ updated jupyterhub to 0.9.4
+ updated jupyterlab hub extension to 0.12.0
+ updated ES_HADOOP to 6.5.3
+ updated org.apache.spark:spark-sql-kafka-0-10_2.11:2.4.0
+ Added extra notebooks to test deployment and provide more information for analyst experiencing Jupyter for the first time
helk-kafka-base
+ reduced docker container size
+ updated Kafka to 2.1.0 (this affects Kafka brokers and zookeeper)
helk-kafka-broker
+ User can now define a list of topics to be created via the new environment variable KAFKA_CREATE_TOPICS. That needs to be defined either in the docker-compose file or while running the docker container on its own.
helk-zookeeper
+ reduced size of container
+ updated build to kafka 2.1.0
helk-KSQL
+ initial integration of KSQL
+ KSQL Server and KSQL CLI are available
+ Blog post coming soon ;)
2018-12-13 21:27:17 +00:00
![version ](https://img.shields.io/badge/version-0.1.4-blue.svg?maxAge=2592000 )
[![License: GPL v3 ](https://img.shields.io/badge/License-GPLv3-blue.svg )](https://www.gnu.org/licenses/gpl-3.0)
[![GitHub issues-closed ](https://img.shields.io/github/issues-closed/Cyb3rward0g/HELK.svg )](https://GitHub.com/Cyb3rWard0g/HELK/issues?q=is%3Aissue+is%3Aclosed)
[![Twitter ](https://img.shields.io/twitter/follow/THE_HELK.svg?style=social&label=Follow )](https://twitter.com/THE_HELK)
[![Open Source Love svg1 ](https://badges.frapsoft.com/os/v1/open-source.svg?v=103 )](https://github.com/ellerbrock/open-source-badges/)
2018-01-06 22:14:43 +00:00
A Hunting ELK (Elasticsearch, Logstash, Kibana) with advanced analytic capabilities.
2018-01-16 01:11:13 +00:00
2018-01-16 01:07:44 +00:00
![alt text ](resources/images/HELK_Design.png "HELK Infrastructure" )
2017-04-14 05:29:04 +00:00
2017-06-29 15:21:59 +00:00
# Goals
2018-06-12 05:28:26 +00:00
2017-06-29 15:21:59 +00:00
* Provide a free hunting platform to the community and share the basics of Threat Hunting.
* Make sense of a large amount of event logs and add more context to suspicious events during hunting.
* Expedite the time it takes to deploy an ELK stack.
2018-01-06 22:14:43 +00:00
* Improve the testing of hunting use cases in an easier and more affordable way.
2018-01-08 23:26:44 +00:00
* Enable Data Science via Apache Spark, GraphFrames & Jupyter Notebooks.
2017-06-29 15:21:59 +00:00
2018-01-31 22:52:50 +00:00
# Current Status: Alpha
2018-06-12 05:28:26 +00:00
2018-01-31 22:52:50 +00:00
The project is currently in an alpha stage, which means that the code and the functionality are still changing. We haven't yet tested the system with large data sources and in many scenarios. We invite you to try it and welcome any feedback.
# HELK Features
2018-06-12 05:28:26 +00:00
2018-01-31 22:52:50 +00:00
* **Kafka:** A distributed publish-subscribe messaging system that is designed to be fast, scalable, fault-tolerant, and durable.
* **Elasticsearch:** A highly scalable open-source full-text search and analytics engine.
* **Logstash:** A data collection engine with real-time pipelining capabilities.
* **Kibana:** An open source analytics and visualization platform designed to work with Elasticsearch.
* **ES-Hadoop:** An open-source, stand-alone, self-contained, small library that allows Hadoop jobs (whether using Map/Reduce or libraries built upon it such as Hive, Pig or Cascading or new upcoming libraries like Apache Spark ) to interact with Elasticsearch.
* **Spark:** A fast and general-purpose cluster computing system. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs.
* **GraphFrames:** A package for Apache Spark which provides DataFrame-based Graphs.
* **Jupyter Notebook:** An open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text.
2018-12-14 15:29:12 +00:00
* **KSQL:** Confluent KSQL is the open source, streaming SQL engine that enables real-time data processing against Apache Kafka®. It provides an easy-to-use, yet powerful interactive SQL interface for stream processing on Kafka, without the need to write code in a programming language such as Java or Python
2018-12-13 21:33:05 +00:00
* **Elastalert:** ElastAlert is a simple framework for alerting on anomalies, spikes, or other patterns of interest from data in Elasticsearch
* **Sigma:** Sigma is a generic and open signature format that allows you to describe relevant log events in a straightforward manner.
2017-06-29 15:21:59 +00:00
2017-05-26 06:11:09 +00:00
# Getting Started
2018-06-12 05:28:26 +00:00
2018-02-15 08:28:48 +00:00
## WIKI
2018-06-12 05:28:26 +00:00
2018-02-15 08:28:48 +00:00
* [Introduction ](https://github.com/Cyb3rWard0g/HELK/wiki )
* [Architecture Overview ](https://github.com/Cyb3rWard0g/HELK/wiki/Architecture-Overview )
* [Kafka ](https://github.com/Cyb3rWard0g/HELK/wiki/Kafka )
* [Logstash ](https://github.com/Cyb3rWard0g/HELK/wiki/Logstash )
* [Elasticsearch ](https://github.com/Cyb3rWard0g/HELK/wiki/Elasticsearch )
* [Kibana ](https://github.com/Cyb3rWard0g/HELK/wiki/Kibana )
* [Spark ](https://github.com/Cyb3rWard0g/HELK/wiki/Spark )
* [Installation ](https://github.com/Cyb3rWard0g/HELK/wiki/Installation )
2018-01-08 23:20:50 +00:00
2018-02-25 07:59:44 +00:00
## (Docker) Accessing the HELK's Images
2018-06-12 05:28:26 +00:00
2018-05-03 19:54:12 +00:00
By default, the HELK's containers are run in the background (Detached). You can see all your docker containers by running the following command:
```
sudo docker ps
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
a97bd895a2b3 cyb3rward0g/helk-spark-worker:2.3.0 "./spark-worker-entr…" About an hour ago Up About an hour 0.0.0.0:8082->8082/tcp helk-spark-worker2
cbb31f688e0a cyb3rward0g/helk-spark-worker:2.3.0 "./spark-worker-entr…" About an hour ago Up About an hour 0.0.0.0:8081->8081/tcp helk-spark-worker
5d58068aa7e3 cyb3rward0g/helk-kafka-broker:1.1.0 "./kafka-entrypoint.…" About an hour ago Up About an hour 0.0.0.0:9092->9092/tcp helk-kafka-broker
bdb303b09878 cyb3rward0g/helk-kafka-broker:1.1.0 "./kafka-entrypoint.…" About an hour ago Up About an hour 0.0.0.0:9093->9093/tcp helk-kafka-broker2
7761d1e43d37 cyb3rward0g/helk-nginx:0.0.2 "./nginx-entrypoint.…" About an hour ago Up About an hour 0.0.0.0:80->80/tcp helk-nginx
ede2a2503030 cyb3rward0g/helk-jupyter:0.32.1 "./jupyter-entrypoin…" About an hour ago Up About an hour 0.0.0.0:4040->4040/tcp, 0.0.0.0:8880->8880/tcp helk-jupyter
ede19510e959 cyb3rward0g/helk-logstash:6.2.4 "/usr/local/bin/dock…" About an hour ago Up About an hour 5044/tcp, 9600/tcp helk-logstash
e92823b24b2d cyb3rward0g/helk-spark-master:2.3.0 "./spark-master-entr…" About an hour ago Up About an hour 0.0.0.0:7077->7077/tcp, 0.0.0.0:8080->8080/tcp helk-spark-master
6125921b310d cyb3rward0g/helk-kibana:6.2.4 "./kibana-entrypoint…" About an hour ago Up About an hour 5601/tcp helk-kibana
4321d609ae07 cyb3rward0g/helk-zookeeper:3.4.10 "./zookeeper-entrypo…" About an hour ago Up About an hour 2888/tcp, 0.0.0.0:2181->2181/tcp, 3888/tcp helk-zookeeper
9cbca145fb3e cyb3rward0g/helk-elasticsearch:6.2.4 "/usr/local/bin/dock…" About an hour ago Up About an hour 9200/tcp, 9300/tcp helk-elasticsearch
```
Then, you will just have to pick which container you want to access and run the following following commands:
2018-01-06 22:14:43 +00:00
```
2018-02-25 07:59:44 +00:00
sudo docker exec -ti < image-name > bash
2018-05-03 19:54:12 +00:00
root@ede2a2503030:/opt/helk/scripts#
2018-01-06 22:14:43 +00:00
```
2018-12-13 21:33:05 +00:00
# Resources
* [Welcome to HELK! : Enabling Advanced Analytics Capabilities ](https://cyberwardog.blogspot.com/2018/04/welcome-to-helk-enabling-advanced_9.html )
* [Spark ](https://spark.apache.org/docs/latest/index.html )
* [Spark Standalone Mode ](https://spark.apache.org/docs/latest/spark-standalone.html )
* [Setting up a Pentesting.. I mean, a Threat Hunting Lab - Part 5 ](https://cyberwardog.blogspot.com/2017/02/setting-up-pentesting-i-mean-threat_98.html )
* [An Integrated API for Mixing Graph and Relational Queries ](https://cs.stanford.edu/~matei/papers/2016/grades_graphframes.pdf )
* [Graph queries in Spark SQL ](https://www.slideshare.net/SparkSummit/graphframes-graph-queries-in-spark-sql )
* [Graphframes Overview ](http://graphframes.github.io/index.html )
* [Elastic Producs ](https://www.elastic.co/products )
* [Elastic Subscriptions ](https://www.elastic.co/subscriptions )
* [Elasticsearch Guide ](https://www.elastic.co/guide/en/elasticsearch/reference/current/index.html )
* [spujadas elk-docker ](https://github.com/spujadas/elk-docker )
* [deviantony docker-elk ](https://github.com/deviantony/docker-elk )
2017-08-12 04:50:56 +00:00
2017-06-29 15:21:59 +00:00
# Author
2018-06-12 05:28:26 +00:00
2018-01-08 22:58:42 +00:00
* Roberto Rodriguez [@Cyb3rWard0g ](https://twitter.com/Cyb3rWard0g ) [@THE_HELK ](https://twitter.com/THE_HELK )
2017-05-26 06:11:09 +00:00
2017-06-29 15:21:59 +00:00
# Contributors
2018-06-12 05:28:26 +00:00
* Jose Luis Rodriguez [@Cyb3rPandaH ](https://twitter.com/Cyb3rPandaH )
2017-09-07 23:21:12 +00:00
* Robby Winchester [@robwinchester3 ](https://twitter.com/robwinchester3 )
2018-02-15 08:28:48 +00:00
* Jared Atkinson [@jaredatkinson ](https://twitter.com/jaredcatkinson )
2018-01-08 22:58:42 +00:00
* Nate Guagenti [@neu5ron ](https://twitter.com/neu5ron )
2018-01-31 23:36:46 +00:00
* Lee Christensen [@tifkin_ ](https://twitter.com/tifkin_ )
2017-06-29 15:21:59 +00:00
# Contributing
2018-06-12 05:28:26 +00:00
2018-01-08 22:58:42 +00:00
There are a few things that I would like to accomplish with the HELK as shown in the To-Do list below. I would love to make the HELK a stable build for everyone in the community. If you are interested on making this build a more robust one and adding some cool features to it, PLEASE feel free to submit a pull request. #SharingIsCaring
2017-06-29 15:21:59 +00:00
2018-07-12 04:29:09 +00:00
# License: GPL-3.0
[ HELK's GNU General Public License ](https://github.com/Cyb3rWard0g/HELK/blob/master/LICENSE )
2017-06-29 15:21:59 +00:00
# TO-Do
2018-06-12 05:28:26 +00:00
2018-01-08 22:58:42 +00:00
- [X] Upload basic Kibana Dashboards
- [X] Integrate Spark & Graphframes
- [X] Add Jupyter Notebook on the top of Spark
2018-01-31 22:52:50 +00:00
- [X] Kafka Integration
2018-05-03 19:54:12 +00:00
- [X] Default X-Pack Basic - Free License Build for ELKStack
- [X] Spark Standalone Cluster Manager integration
- [X] Apache Arrow Integration for Pandas Dataframes
- [ ] Zepplin Notebook Docker option
v0.1.6-alpha12132018
HELK base image
+ Updated to 0.0.3
HELK ELK Version
+ Now using 6.5.3 official ELK Docker Images (https://www.elastic.co/blog/elastic-stack-6-5-3-released)
helk_install
+ Users can now select between two deployments:
++ helk-kibana-analysis (KAFKA + KSQL + ELK + NGNIX + ELASTALERT)
++ helk-kibana-notebooks (KAFKA + KSQL + ELK + NGNIX + ELASTALERT + SPARK + JUPYTER)
+ Fixed https://github.com/Cyb3rWard0g/HELK/issues/131 . Users can now set up the Kibana UI User password during installation. Also, user can set the Elasticsearch elastic account password when using the Trial license option.
helk-elastalert
+ Elastalert deployed and ready to use with SIGMA integration. Blog available at https://medium.com/@Cyb3rWard0g
helk-elasticsearch
+ consolidated main configs in one
+ added more environment variables for ELASTIC_PASSWORD and default values in case it is not used to be compatible with the default values applied to HELK.
helk-logstash
+ updated to 6.5.3
+ simplified pipeline to have only one folder
+ logstash-entrypoint script can now enable elastic password on all logstash output conf files.
+ New environment variables (ELASTIC_PASSWORD, ELASTIC_HOST, ELASTIC_PORT)
helk-nginx
+ split the default config for the two deployment options (helk-kibana-analysis (trial/base) and helk-kibana-notebook-analysis (trial/base)
helk-kibana
+ Updated to version 6.5.3
+ Added new environment variables (ELASTICSEARCH_URL, SERVER_HOST, SERVER_PORT, ELASTIC_PASSWORD, ELASTIC_HOST, ELASTIC_PORT, ELASTICSEARCH_USERNAME, ELASTICSEARCH_PASSWORD, KIBANA_UI_PASSWORD) and logic to make the build more dynamic
helk-jupyter
+ updated Jupyterlab to 0.35.4
+ updated jupyterhub to 0.9.4
+ updated jupyterlab hub extension to 0.12.0
+ updated ES_HADOOP to 6.5.3
+ updated org.apache.spark:spark-sql-kafka-0-10_2.11:2.4.0
+ Added extra notebooks to test deployment and provide more information for analyst experiencing Jupyter for the first time
helk-kafka-base
+ reduced docker container size
+ updated Kafka to 2.1.0 (this affects Kafka brokers and zookeeper)
helk-kafka-broker
+ User can now define a list of topics to be created via the new environment variable KAFKA_CREATE_TOPICS. That needs to be defined either in the docker-compose file or while running the docker container on its own.
helk-zookeeper
+ reduced size of container
+ updated build to kafka 2.1.0
helk-KSQL
+ initial integration of KSQL
+ KSQL Server and KSQL CLI are available
+ Blog post coming soon ;)
2018-12-13 21:27:17 +00:00
- [X] KSQL Client & Server Deployment (Waiting for v5.0)
2018-05-03 19:54:12 +00:00
- [ ] Kubernetes Cluster Migration
- [ ] OSQuery Data Ingestion
v0.1.6-alpha12132018
HELK base image
+ Updated to 0.0.3
HELK ELK Version
+ Now using 6.5.3 official ELK Docker Images (https://www.elastic.co/blog/elastic-stack-6-5-3-released)
helk_install
+ Users can now select between two deployments:
++ helk-kibana-analysis (KAFKA + KSQL + ELK + NGNIX + ELASTALERT)
++ helk-kibana-notebooks (KAFKA + KSQL + ELK + NGNIX + ELASTALERT + SPARK + JUPYTER)
+ Fixed https://github.com/Cyb3rWard0g/HELK/issues/131 . Users can now set up the Kibana UI User password during installation. Also, user can set the Elasticsearch elastic account password when using the Trial license option.
helk-elastalert
+ Elastalert deployed and ready to use with SIGMA integration. Blog available at https://medium.com/@Cyb3rWard0g
helk-elasticsearch
+ consolidated main configs in one
+ added more environment variables for ELASTIC_PASSWORD and default values in case it is not used to be compatible with the default values applied to HELK.
helk-logstash
+ updated to 6.5.3
+ simplified pipeline to have only one folder
+ logstash-entrypoint script can now enable elastic password on all logstash output conf files.
+ New environment variables (ELASTIC_PASSWORD, ELASTIC_HOST, ELASTIC_PORT)
helk-nginx
+ split the default config for the two deployment options (helk-kibana-analysis (trial/base) and helk-kibana-notebook-analysis (trial/base)
helk-kibana
+ Updated to version 6.5.3
+ Added new environment variables (ELASTICSEARCH_URL, SERVER_HOST, SERVER_PORT, ELASTIC_PASSWORD, ELASTIC_HOST, ELASTIC_PORT, ELASTICSEARCH_USERNAME, ELASTICSEARCH_PASSWORD, KIBANA_UI_PASSWORD) and logic to make the build more dynamic
helk-jupyter
+ updated Jupyterlab to 0.35.4
+ updated jupyterhub to 0.9.4
+ updated jupyterlab hub extension to 0.12.0
+ updated ES_HADOOP to 6.5.3
+ updated org.apache.spark:spark-sql-kafka-0-10_2.11:2.4.0
+ Added extra notebooks to test deployment and provide more information for analyst experiencing Jupyter for the first time
helk-kafka-base
+ reduced docker container size
+ updated Kafka to 2.1.0 (this affects Kafka brokers and zookeeper)
helk-kafka-broker
+ User can now define a list of topics to be created via the new environment variable KAFKA_CREATE_TOPICS. That needs to be defined either in the docker-compose file or while running the docker container on its own.
helk-zookeeper
+ reduced size of container
+ updated build to kafka 2.1.0
helk-KSQL
+ initial integration of KSQL
+ KSQL Server and KSQL CLI are available
+ Blog post coming soon ;)
2018-12-13 21:27:17 +00:00
- [X] Create Jupyter Notebooks showing how to use Spark & GraphFrames
2018-01-08 23:20:50 +00:00
- [ ] MITRE ATT& CK mapping to logs or dashboards
2018-05-03 19:54:12 +00:00
- [ ] Cypher for Apache Spark Integration (Might have to switch from Jupyter to Zeppelin Notebook)
2018-01-08 22:58:42 +00:00
- [ ] Somehow integrate neo4j spark connectors with build
v0.1.6-alpha12132018
HELK base image
+ Updated to 0.0.3
HELK ELK Version
+ Now using 6.5.3 official ELK Docker Images (https://www.elastic.co/blog/elastic-stack-6-5-3-released)
helk_install
+ Users can now select between two deployments:
++ helk-kibana-analysis (KAFKA + KSQL + ELK + NGNIX + ELASTALERT)
++ helk-kibana-notebooks (KAFKA + KSQL + ELK + NGNIX + ELASTALERT + SPARK + JUPYTER)
+ Fixed https://github.com/Cyb3rWard0g/HELK/issues/131 . Users can now set up the Kibana UI User password during installation. Also, user can set the Elasticsearch elastic account password when using the Trial license option.
helk-elastalert
+ Elastalert deployed and ready to use with SIGMA integration. Blog available at https://medium.com/@Cyb3rWard0g
helk-elasticsearch
+ consolidated main configs in one
+ added more environment variables for ELASTIC_PASSWORD and default values in case it is not used to be compatible with the default values applied to HELK.
helk-logstash
+ updated to 6.5.3
+ simplified pipeline to have only one folder
+ logstash-entrypoint script can now enable elastic password on all logstash output conf files.
+ New environment variables (ELASTIC_PASSWORD, ELASTIC_HOST, ELASTIC_PORT)
helk-nginx
+ split the default config for the two deployment options (helk-kibana-analysis (trial/base) and helk-kibana-notebook-analysis (trial/base)
helk-kibana
+ Updated to version 6.5.3
+ Added new environment variables (ELASTICSEARCH_URL, SERVER_HOST, SERVER_PORT, ELASTIC_PASSWORD, ELASTIC_HOST, ELASTIC_PORT, ELASTICSEARCH_USERNAME, ELASTICSEARCH_PASSWORD, KIBANA_UI_PASSWORD) and logic to make the build more dynamic
helk-jupyter
+ updated Jupyterlab to 0.35.4
+ updated jupyterhub to 0.9.4
+ updated jupyterlab hub extension to 0.12.0
+ updated ES_HADOOP to 6.5.3
+ updated org.apache.spark:spark-sql-kafka-0-10_2.11:2.4.0
+ Added extra notebooks to test deployment and provide more information for analyst experiencing Jupyter for the first time
helk-kafka-base
+ reduced docker container size
+ updated Kafka to 2.1.0 (this affects Kafka brokers and zookeeper)
helk-kafka-broker
+ User can now define a list of topics to be created via the new environment variable KAFKA_CREATE_TOPICS. That needs to be defined either in the docker-compose file or while running the docker container on its own.
helk-zookeeper
+ reduced size of container
+ updated build to kafka 2.1.0
helk-KSQL
+ initial integration of KSQL
+ KSQL Server and KSQL CLI are available
+ Blog post coming soon ;)
2018-12-13 21:27:17 +00:00
- [X] Nxlog parsers (Logstash Filters)
2018-01-08 22:58:42 +00:00
- [ ] Add more network data sources (i.e Bro)
2018-03-04 04:44:09 +00:00
- [ ] Research & integrate spark structured direct streaming
2018-12-14 15:29:12 +00:00
- [ ] Packer Images
2017-05-26 06:31:12 +00:00
More coming soon...