Information Security Analytics Blog: Cyber Attack Graph Schema (CAGS) 1.0

Monday, July 29, 2013

Cyber Attack Graph Schema (CAGS) 1.0

While the concept of attack graphs has been discussed, once thing that is lacking is a standard definition for an attack graph. This blog hopes to resolve that by presenting a new standard: the Cyber Attack Graph Schema (CAGS) 1.0

1. All property names must be lower case

2. Nodes must have the following properties:

1. "class": May be "actor", "event", "condition", "attribute"

2. "cpt": must be a JSON string in the format defined at http://infosecanalytics.blogspot.com/2013/03/conditional-probability-tables-in-json.html

3. "start": The time the node is created. Time should be in ISO 8601 combined date and time format (e.g. 2013-03-14T16:57Z)

4. "id": Assigned by database.

3. Nodes must have property "label".

4. The "label" property of nodes of "class" "event", "condition", or "actor" will contain a string holding a narrative describing the actor, event, or condition

5. The "label" property of nodes of "class" "attribute" must contain a JSON formatted string with a single "{'type':'value'}" pair. Type is the type/name of the attribute and value the value.

6. Nodes of any class MAY have property "comments" providing additional narrative on the node

7. Nodes of any class MAY have property "finish" providing a finish time for the node. Time should be in ISO 8601 combined date and time format (e.g. 2013-03-14T16:57Z)

8. Edges must have the following properties:

1. "source": the id of the source node

2. "target": the id of the target node

3. "id": id assigned by the database

4. "relationship":

1. Value of "influence" if "source" property "class" is "attribute" and "target" property "class" is "event" or "condition". Value of "leads to" if "source" property "class" is "event", "threat"

2. Value of "influence" if "condition" and "target" property "class" is "actor", "event", or "condition"

3. Value of "described by" if "source" property "class" is "event", "condition", or "actor" and "target" property "class" is "attribute"

4. Value of "described by" if both "source" and "target" property "class" are "attribute"

5. "directed": value of "True"

9. Edges may have a property "confidence" with an integer value from 0 to 100 representing the percent confidence

10. Edges must be directed

11. Nodes and Edges may have additional properties, however they will not be validated and may be ignored by the attack graph.

12. Nodes and Edges missing values may still be accepted if the value can be filled in.

13 comments:

GabeSeptember 4, 2013 at 1:11 PM
Consider replacing spaces with underscores (i.e. "described by" becomes "described_by".)

Consider replacing "start" with "start_time" as start is ambiguous in some cypher queries.

Consider describing attributes as {"class":"attribute", "attribute":, :} rather than just {"class":"attribute", :} to improve ease of querying the graph.
ReplyDelete
Replies
GabeNovember 5, 2013 at 8:48 AM
Consider requiring edges to have a start_time.
ReplyDelete
Replies
GabeMay 25, 2014 at 5:27 AM
Consider describing attributes as {"class":"attribute", "attribute type":, "type value":}. This would improve querying the graph directly for a value and for the type.
ReplyDelete
Replies
GabeAugust 16, 2014 at 4:29 PM
'label' is reserved in some graph databases. Consider using the class value in place of label and indexing all class values on all nodes.
ReplyDelete
Replies
GabeAugust 16, 2014 at 4:30 PM
The cpt requirement will be removed in next version.
ReplyDelete
Replies
GabeAugust 20, 2014 at 2:58 PM
Graph IDs should be a URI of the form :?class=< node class>&=&= so class:attribute, attribute = ip, ip = 8.8.8.8 at mybiz would be mybiz:?class=attribute&attribute=ip&ip=8.8.8.8
ReplyDelete
Replies
GabeAugust 24, 2014 at 4:56 PM
To allow efficient storage, it may be necessary to express {class:, :, :,} with explicit columns of {class:, key:, value:}. The advantage is that nodes can be indexed on class, key, and value. The limitation is that the a:b, b:c, c:d, d:etc, chain is limited in length.
ReplyDelete
Replies
GabeAugust 24, 2014 at 6:29 PM
Consider making edge URIs derived from their source, relationship, destination triple.

In documentation, may want to correlate source, relationship, destination to subject, predicate, object.
ReplyDelete
Replies
GabeAugust 29, 2014 at 9:41 AM
Consider allowing edges to have sub-relationships such as: .

Consider allowing edges to have an origin to explain the enrichment they came from. e.g. .
ReplyDelete
Replies
GabeSeptember 18, 2014 at 12:21 PM
The URI should be stored as an attribute to the node or edge with a key of 'uri' and should be used as the node and edge id whenever possible.
ReplyDelete
Replies
GabeSeptember 30, 2014 at 5:26 AM
Need to consider how to handle the difference between "no relationship found" and "creation of relationship not attempted".
ReplyDelete
Replies
GabeOctober 31, 2014 at 11:17 AM
Prefixes should not be required on URIs within a graph. The reasoning being that if the nodes/edges are within a graph, the prefix is implicit.

The case exists where we may wish to suggest that knowledge about a node resides in another graph. While adding the prefix to the node would indicate that, it also allows for two nodes of the same key:value to exist in the same graph. Moreso, a key:value node such as could be used to suggest an algorithm should query another graph for the information.

This does not preclude having a prefix on a node in a graph, (with the absence of a prefix implying the location of the graph represents the prefix), however such a prefix would require a means of translating a prefix to a fully qualified location which does not currently exist in the schema.

This does not preclude including a prefix (or the fully qualified URI) in a client subgraph to help distinguish between nodes from different locations. However, it will still suffer from the same issue of potential duplicate nodes. It is more advisable that prefixes only be kept for edges. The client may choose how to keep the mapping between prefix and full location.
ReplyDelete
Replies