Rule
|
|
---|---|
Any keyword |
A file must contain at least one keyword
in the keyword list.
|
All keywords
|
A file must contain all the keywords in the keyword list. |
All keywords within <x> characters
|
A file must contain all the keywords in the keyword list. In addition, the number
of characters
from the beginning of the first keyword to the beginning of the last keyword must
be within <x>
characters.
For example, your 3 keywords
are WEB, DISK, and USB and the number of characters you specified
is 20.
If IMSVA detects all keywords in the order DISK, WEB, and
USB, the number of characters from the "D" (in DISK) to the "U" (in USB) must be 20
characters or
less.
When deciding on the
number of characters, remember that a small number, such as 10,
will usually result in faster scanning time but will only cover
a relatively small area. This may reduce the likelihood of detecting
sensitive data, especially in large files. As the number increases,
the area covered also increases but scanning time might be slower.
|
Combined score for keywords exceeds threshold
|
A file must contain one or more keywords
in the keyword list. If only one keyword was detected, its score
must be higher than the threshold. If there are several keywords,
their combined score must be higher than the threshold.
Assign
each keyword a score of 1 to 10. A highly confidential word or phrase,
such as "salary increase" for the Human Resources department, should
have a relatively high score. Words or phrases that, by themselves,
do not carry much weight can have lower scores.
Consider the scores assigned to keywords when configuring the threshold. For example,
if you have
five keywords and three of those keywords are high priority, the threshold can be
equal to or lower
than the combined score of the three high priority keywords. This means that the detection
of these
three keywords is enough to treat the file as sensitive.
|