22 KiB
Templating Guide
Nuclei is based on the concepts of YAML
based template files that define how the requests will be sent and processed. This allows easy extensibility capabilities to nuclei.
The templates are written in YAML
which specifies a simple human readable format to quickly define the execution process.
Let's start with the basics and define our own workflow file for detecting the presence of a .git/config
file on a webserver and take it from there.
Table of Contents
Template Details
Each template has a unique ID which is used during output writing to specify the template name for an output line.
The template file ends with .yaml extension. The template files can be created any text editor of your choice.
# id contains the unique identifier for the template.
id: git-config
ID must not contain spaces. This is done to allow easier output parsing.
Info
Next important piece of information about a template is the info block. Info block provides more context on the purpose of the template and the author. It can also contain a severity field which indicates the severity of the template.
Let's add an info block to our template as well.
info:
# Name is the name of the template
name: Git Config File Detection Template
# Author is the name of the author for the template
author: Ice3man
# Severity is the severity for the template.
severity: medium
Actual requests and corresponding matchers are placed below the info block and they perform the task of making requests to target servers and finding if the template request was successful.
Each template file can contain multiple requests to be made. The template is iterated and one by one the desired HTTP/DNS requests are made to the target sites.
HTTP Requests
Requests start with a request block which specifies the start of the requests for the template.
# Start the requests for the template right here
requests:
At this point you can define raw requests like the following ones (as of now it's suggested to leave the Host
header as in the example with the variable {{Hostname}}
).
requests:
- raw:
- |
GET /path1/ HTTP/1.1
User-Agent: chrome
Host: {{Hostname}}
Accept: a1/b1
- |
POST /path2/ HTTP/1.1
User-Agent: chrome
Host: {{Hostname}}
Accept: a2/b2
This is the request Body
Otherwise you can define structured requests as described in the following paragraphs. Requests can be fine tuned to perform the exact tasks as desired. Nuclei requests are fully configurable meaning you can configure and define each and every single thing about the requests that will be sent to the target servers. A recent addition in raw requests has been the introduction of intruder-like functionalities. It's possible to define placeholders as {{placeholder}}
, and perform Sniper, Pitchfork and ClusterBomb attacks. The wordlist for these attacks needs to be defined during the request definition under the Payload
field. Finally all DSL functionalities are fully available and supported, and can be used to manipulate the final values. Here follows an example:
id: dummy-raw
info:
name: Example-Fuzzing
requests:
- payloads:
param_a: /home/user/wordlist_param_a.txt
param_b: /home/user/wordlist_param_b.txt
attack: clusterbomb # Available options: sniper, pitchfork and clusterbomb
raw:
# Request with simple param and header manipulation with DSL functions
- |
POST /?param_a={{param_a}}¶mb={{param_b}} HTTP/1.1
User-Agent: {{param_a}}
Host: {{Hostname}}
another_header: {{base64(param_b)}}
Accept: */*
This is the Body
# Request with body manipulation
- |
DELETE / HTTP/1.1
User-Agent: nuclei
Host: {{Hostname}}
This is the body {{sha256(param_a)}}
# Yet another one
- |
PUT / HTTP/1.1
Host: {{Hostname}}
This is again the request body {{html_escape(param_a)}} + {{hex_encode(param_b))}}
matchers:
- type: word
words:
- "title"
- "body"
This functionality is not optimized for speed as the internal http library needs to be set explicitly with connection reuse policy.
Method
First thing in the request is method. Request method can be GET, POST, PUT, DELETE, etc depending on the needs.
# Method is the method for the request
method: GET
Redirects
Redirection conditions can be specified per each template. By default, redirects are not followed. However, if desired, they can be enabled with redirects: true
in request details. 10 redirects are followed at maximum by default which should be good enough for most use cases. More fine grained control can be exercised over number of redirects followed by using max-redirects
field.
An example of the usage:
requests:
- method: GET
path:
- "{{BaseURL}}/login.php
redirects: true
max-redirects: 3
Path
The next part of the requests is the path of the request path. Dynamic variables can be placed in the path to modify its behavior on runtime. Variables start with {{
and end with }}
and are case-sensitive.
- BaseURL - Placing BaseURL as a variable in the path will lead to it being replaced on runtime in the request by the original URL as specified in the target file.
- Hostname - Hostname variable is replaced by the hostname of the target on runtime.
Some sample dynamic variable replacement examples:
path: {{BaseURL}}/.git/config
# This path will be replaced on execution with BaseURL
# If BaseURL is set to https://abc.com then the
# path will get replaced to the following: https://abc.com/.git/config
Multiple paths can also be specified in one request which will be requested for the target.
Headers
Headers can also be specified to be sent along with the requests. Headers are placed in form of key/value pairs. An example header configuration looks like this:
# headers contains the headers for the request
headers:
# Custom user-agent header
User-Agent: Some-Random-User-Agent
# Custom request origin
Origin: https://google.com
Body
Body specifies a body to be sent along with the request. For instance:
# Body is a string sent along with the request
body: "{\"some random JSON\"}"
Matchers
Matchers are the core of nuclei. They are what make the tool so powerful. Multiple type of combinations and checks can be added to ensure that the results you get are free from false-positives.
Types
Multiple matchers can be specified in a request. There are basically 6 types of matchers:
Matcher Type | Part Matched |
---|---|
status | Status Code of Response |
size | Content Length of Response |
word | Response body or headers |
regex | Response body or headers |
binary | Response body |
dsl | All Response Parts |
To match status codes for responses, you can use the following syntax.
matcher:
# Match the status codes
- type: status
# Some status codes we want to match
status:
- 200
- 302
To match binary for hexadecimal responses, you can use the following syntax.
matchers:
- type: binary
binary:
- "504B0304" # zip archive
- "526172211A070100" # rar RAR archive version 5.0
- "FD377A585A0000" # xz tar.xz archive
condition: or
part: body
To match size, similar structure can be followed. If the status code of response from the site matches any single one specified in the matcher, the request is marked as successful.
Word and Regex matchers can be further configured depending on the needs of the users.
Complex matchers of type dsl allows to build more elaborated expressions with helper functions, this is an example of a complex DSL matcher:
matchers:
- type: dsl
dsl:
- "len(body)<1024 && status_code==200" # Body length less than 1024 and 200 status code
- "contains(toupper(body), md5(cookie))" # Check if the MD5 sum of cookies is contained in the uppercase body
Every part of a http response can be matched:
Response Part | Description | Example |
---|---|---|
content_length | Header Content-Length | content_length >= 1024 |
status_code | Response Status Code | status_code==200 |
all_headers | Unique string containing all headers | len(all_headers) |
body | Body as string | len(body) |
header_name | Lowercase header name with "-" converted to "_" | len(user_agent) |
raw | Headers + Response | len(raw) |
This is the list for a dns response:
Response Part | Description | Example |
---|---|---|
rcode | Response status | rcode == "NXDOMAIN |
question | Response Question Section | len(question) |
extra | Response Extra Section | len(extra) |
answer | Response Answers Section | len(answer) |
ns | Response Authority Section | len(ns) |
raw | Full Response | len(raw) |
The helper functions are:
Helper Function | Description | Example |
---|---|---|
len | Length of a string | len("Hello") // Result: 5 |
toupper | string to uppercase | toupper("Hello") // Result: "HELLO" |
tolower | string to lowercase | tolower("Hello") // Result: "hello" |
replace | Replace string parts | replace("Hello", "He", "Ha") // Result: "Hallo" |
trim | Remove trailing unicode chars | trim("aaaHelloddd", "ad") // Result: "Hello" |
trimleft | Remove unicode chars from left | trimleft("aaaHelloddd", "ad") // Result: "Helloddd" |
trimright | Remove unicode chars from right | trimleft("aaaHelloddd", "ad") // Result: "aaaHello" |
trimspace | Remove trailing spaces | trimspace(" Hello ") // Result: "Hello" |
trimprefix | Trim specified prefix | trimprefix("aaHelloaa", "aa") // Result: "Helloaa" |
trimsuffix | Trim specified suffix | trimsuffix("aaHelloaa", "aa") // Result: "aaHello" |
base64 | Encode string to base64 | base64("Hello") // Result: "SGVsbG8=" |
base64_decode | Decode string from base64 | base64_decode("SGVsbG8=") // Result: "Hello" |
url_encode | URL encode a string | url_encode("https://projectdiscovery.io/test?a=1") // Result: "https:%2F%2Fprojectdiscovery.io%3Ftest=1" |
url_decode | URL decode a string | url_decode("https:%2F%2Fprojectdiscovery.io%3Ftest=1") // Result: "https://projectdiscovery.io/test?a=1" |
hex_encode | Hex encode a string | hex_encode("aa") // Result: "6161" |
hex_decode | Hex decode a string | hex_decode("6161") // Result: "aa" |
html_escape | Hex encode a string | html_escape("<html>test</html>") // Result: "<html><body>test</body></html>" |
html_unescape | Hex decode a string | html_unescape("<html><body>test</body></html>") // Result: "<html>test</html>" |
md5 | Calculate md5 of string | md5("Hello") // Result: "8b1a9953c4611296a827abf8c47804d7" |
sha256 | Calculate sha256 of string | sha256("Hello") // Result: "185f8db32271fe25f561a6fc938b2e264306ec304eda518007d1764826381969" |
sha1 | Calculate sha1 of string | sha1("Hello") // Result: "f7ff9e8b7bb2e09b70935a5d785e0cc5d9d0abf0" |
contains | Verify if a string contains another one | contains("Hello", "lo") // Result: True |
regex | Verify a regex versus a string | regex("H([a-z]+)o", "Hello") // Result: True |
Conditions
Multiple words and regexes can be specified in a single matcher and can be configured with different conditions like AND and OR.
- AND - Using AND conditions allows matching of all the words from the list of words for the matcher. Only then will the request be marked as successful when all the words have been matched.
- OR - Using OR conditions allows matching of a single word from the list of matcher. The request will be marked as successful when even one of the word is matched for the matcher.
Matched Parts
Multiple parts of the response can also be matched for the request.
Part | Matched Part |
---|---|
body | Body of the response |
header | Header of the response |
all | Both body and header of the response |
Example matchers for response body using the AND condition:
matcher:
# Match the body word
- type: word
# Some words we want to match
words:
- "[core]"
- "[config]"
# Both words must be found in the response body
condition: and
# We want to match request body (default)
part: body
Similarly, matchers can be written to match anything that you want to find in the response body allowing unlimited creativity and extensibility.
Multiple Matchers
Multiple matchers can be used in a single template to fingerprint multiple conditions with a single request.
Here is an example of syntax for multiple matchers.
matchers:
- type: word
name: php
words:
- "X-Powered-By: PHP"
- "PHPSESSID"
part: header
- type: word
name: node
words:
- "Server: NodeJS"
- "X-Powered-By: nodejs"
condition: or
part: header
- type: word
name: python
words:
- "Python/2."
- "Python/3."
condition: or
part: header
Matchers Condition
While using multiple matchers the default condition is to follow OR operation in between all the matchers, AND operation can be used to make sure return the result if all matchers returns true.
matchers-condition: and
matchers:
- type: word
words:
- "X-Powered-By: PHP"
- "PHPSESSID"
condition: or
part: header
- type: word
words:
- "PHP"
part: body
Extractors
Extractors are another important feature of nuclei. Extractors can be used to extract and display in results a match from the response body or headers based on a regular expression.
Currently only regex
type extractors are supported. A sample extractor for extracting API keys from the response body is as follows:
# A list of extractors for text extraction
extractors:
# type of the extractor, only regex for now.
- type: regex
# part of the response to extract (can be headers, all too)
part: body
# regex to use for extraction.
regex:
- "(A3T[A-Z0-9]|AKIA|AGPA|AROA|AIPA|ANPA|ANVA|ASIA)[A-Z0-9]{16}"
Example HTTP Template
The final template file for the .git/config
file mentioned above is as follows:
id: git-config
info:
name: Git Config File
author: Ice3man
severity: medium
requests:
- method: GET
path:
- "{{BaseURL}}/.git/config"
matchers:
- type: word
words:
- "[core]"
DNS Requests
Requests start with a dns block which specifies the start of the requests for the template.
# Start the requests for the template right here
dns:
DNS requests can be fine tuned to perform the exact tasks as desired. Nuclei requests are fully configurable meaning you can configure and define each and every single thing about the requests that will be sent to the target servers.
Type
First thing in the request is type. Request type can be A, NS, CNAME, SOA, PTR, MX, TXT, AAAA.
# type is the type for the dns request
type: A
Name
The next part of the requests is the name of the request path. Dynamic variables can be placed in the path to modify its value on runtime. Variables start with {{
and end with }}
and are case-sensitive.
- FQDN - variable is replaced by the hostname/FQDN of the target on runtime.
Some sample dynamic variable replacement examples:
name: {{FQDN}}.com
# This value will be replaced on execution with the FQDN.
# If FQDN is https://this.is.an.example then the
# name will get replaced to the following: this.is.an.example.com
As of now the tool supports only one question per request.
Class
Class type can be INET, CSNET, CHAOS, HESIOD, NONE and ANY. Usually it's enough to just leave it as INET.
# method is the class for the dns request
class: inet
Recursion
Recursion is a boolean value, and determines if the resolver should only return cached results, or traverse the whole dns root tree to retrieve fresh results. Generally it's better to leave it as true.
# Recursion is a boolean determining if the request is recursive
recursion: true
Retries
Retries is the number of attempts a dns query is retried before giving up among different resolvers. It's recommended a reasonable value, like 3.
# Retries is a number of retries before giving up on dns resolution
retries: 3
Matchers
Matchers are just equal to HTTP, but the search is performed on the whole dns response, therefore it's not necessary to specify the part. Multiple type of combinations and checks can be added to ensure that the results you get are free from false positives. The complex dsl matcher type allows to build complex queries as described in the HTTP section.
Types
Multiple matchers can be specified in a request. There are basically 5 types of matchers:
Matcher Type | Part Matched |
---|---|
size | Response size |
word | Response |
regex | Response |
binary | Response |
dsl | Response |
Example DNS Template
The final example template file for performing A
query, and check if CNAME and A records are in the response is as follows:
id: dummy-cname-a
info:
name: Dummy A dns request
author: mzack9999
severity: none
dns:
- name: "{{FQDN}}"
type: A
class: inet
recursion: true
retries: 3
matchers:
- type: word
words:
# The response must contains a CNAME record
- "IN\tCNAME"
# and also at least 1 A record
- "IN\tA"
condition: and