benford's law
Benford’s Law Application
I recently
obtained and reviewed accounting data from a client to perform a
Benford’s analysis. I obtained billing history data and general
ledger data covering multiple periods and accounts.
Benford’s is a very quick and easy technique to assist an
auditor in determining where there may be areas of greater risk.
While not the end all be all by any means, it can quickly
provide additional insight into areas that may warrant
additional investigation.
Used with other tools, it can corroborate other findings
or, perhaps, provide a risk indicator not provided by other
means.
Once a Benford’s
analysis is complete, the results could be used as the nexus to
request client internal audit reports.
We’re not fishing, we’re confirming analytical findings.
What follows is
client data from their billing system.
The universe consisted of 439,741 records covering 167
separate accounts.
When Benford’s is applied to this total universe we get the
following results.
Total Records
One
Two
Three
Four
Five
Six
Seven
Eight
Nine
414,355
148,981
50,480
37,000
42,768
53,050
21,510
18,763
22,299
19,504
Calculated Benford's
35.95%
12.18%
8.93%
10.32%
12.80%
5.19%
4.53%
5.38%
4.71%
Standard Benford's
30.10%
17.61%
12.49%
9.69%
7.92%
6.70%
5.80%
5.12%
4.58%
Diff From Standard
5.85%
-5.43%
-3.56%
0.63%
4.89%
-1.50%
-1.27%
0.27%
0.13%
Pct Diff From Std
19.44%
-30.81%
-28.53%
6.51%
61.70%
-22.46%
-21.91%
5.21%
2.86%
In the previous example, it appears that Benford’s did not work
as promised. Not so.
As previously
noted, we have 167 different accounts in this universe.
The calculation was made across all these non-homogenous
accounts.
Homogeneity is necessary for Benford’s to work.
Many of these accounts may have attributes that prevents
Benford’s from working properly.
For example, there might be accounts containing recurring
purchases of the same amount and or restricted by a minimum or
maximum amount, like an hourly wage rate.
This would defeat the Benford’s requirement that the
amounts be “naturally occurring”.
In the same vein, serial numbers, invoice numbers or any
other instance where numbers are assigned in a document will not
respond as anticipated in accordance with Benford’s Law.
Further, Benford’s doesn’t work well in very small
samples. You should
have at least 500 transactions in the data set you’re testing.
You may recall
that there were a total of 439,741 records.
If so, you may have noticed that I only have data for
414,355. The missing
25,386 records were those that had a value of zero or less than
one. These records
were removed from the sample.
Client Account - Post
Differential No Fringe – POOR Benford’s Match
Total Records | One | Two | Three | Four | Five | Six | Seven | Eight | Nine | ||
8082 | 642 | 282 | 202 | 157 | 615 | 1342 | 640 | 743 | 3459 | ||
Calculated Benford's | 7.94% | 3.49% | 2.50% | 1.94% | 7.61% | 16.60% | 7.92% | 9.19% | 42.80% | ||
Standard Benford's | 30.10% | 17.61% | 12.49% | 9.69% | 7.92% | 6.70% | 5.80% | 5.12% | 4.58% | ||
Diff From Std | -22.16% | -14.12% | -9.99% | -7.75% | -0.31% | 9.91% | 2.12% | 4.08% | 38.22% | ||
Pct Diff From Std | -73.61% | -80.18% | -80.00% | -79.95% | -3.90% | 148.02% | 36.56% | 79.73% | 835.29% |
The preceding calculations were made from more than enough
records to provide Benford’s with an opportunity to work, but,
as you can see, there are substantial differences between the
Calculated Benford’s and what was anticipated in Standard
Benford’s. As this
is Post Differential, the sample does not conform to Benford’s
“naturally occurring” requirement.
Post is based upon a standard percentage of salary which
does not permit the numbers to be random. For an account like
this, Benford’s is of little value.
Client Account - Expensed
Equipment – GOOD Benford’s Match
Total Records | One | Two | Three | Four | Five | Six | Seven | Eight | Nine | ||
2130 | 642 | 422 | 253 | 181 | 177 | 138 | 103 | 106 | 108 | ||
Calculated Benfords | 30.14% | 19.81% | 11.88% | 8.50% | 8.31% | 6.48% | 4.84% | 4.98% | 5.07% | ||
Standard Benfords | 30.10% | 17.61% | 12.49% | 9.69% | 7.92% | 6.70% | 5.80% | 5.12% | 4.58% | ||
Diff From Std | 0.04% | 2.20% | -0.62% | -1.19% | 0.39% | -0.22% | -0.96% | -0.14% | 0.49% | ||
Pct Diff From Std | 0.13% | 12.51% | -4.93% | -12.31% | 4.95% | -3.23% | -16.61% | -2.71% | 10.80% |
Conversely, the preceding Expensed Equipment account’s results
closely track the Benford’s standard.
The auditor would not rely solely on these results to
conclude that risk was minimal; nevertheless, this quick
analysis would be used to corroborate other data.
Client Account -
Tools – POOR Benford’s Match
Total Records
One
Two
Three
Four
Five
Six
Seven
Eight
Nine
388
145
62
44
35
23
18
23
18
20
Calculated Benford's
37.37%
15.98%
11.34%
9.02%
5.93%
4.64%
5.93%
4.64%
5.15%
Standard Benford's
30.10%
17.61%
12.49%
9.69%
7.92%
6.70%
5.80%
5.12%
4.58%
Diff From Std
7.27%
-1.63%
-1.15%
-0.67%
-1.99%
-2.06%
0.13%
-0.48%
0.58%
Pct Diff From Std
24.14%
-9.25%
-9.23%
-6.92%
-25.13%
-30.71%
2.22%
-9.30%
12.65%
This tools account is a poor Benford’s match.
Aside from actual fraud, there are a couple of reasons
why this analysis didn’t match Benford’s norms.
First is the number of records selected.
There are only 388 records.
In readings, various authors have suggested a variety of
minimum sample sizes for Benford’s to work.
One author suggested that the lowest number of records
permitting a Benford’s analysis was 500.
Other’s suggested the lowest as anywhere from 1,000 to
2,500 to achieve useable results.
In addition to the very small sample size, the nature of
the account might lend itself to repeated purchases of selected
tools that are quickly broken or stolen.
This may suggest that other analytical techniques are
needed as Benford’s doesn’t perform well under these
circumstances.
Client Account -
Materials 1 – FAIR Benford’s Match
Total Records | One | Two | Three | Four | Five | Six | Seven | Eight | Nine | ||
1203 | 393 | 196 | 160 | 87 | 109 | 97 | 65 | 49 | 47 | ||
Calculated Benford's | 32.67% | 16.29% | 13.30% | 7.23% | 9.06% | 8.06% | 5.40% | 4.07% | 3.91% | ||
Standard Benford's | 30.10% | 17.61% | 12.49% | 9.69% | 7.92% | 6.70% | 5.80% | 5.12% | 4.58% | ||
Diff From Std | 2.57% | -1.32% | 0.81% | -2.46% | 1.14% | 1.37% | -0.40% | -1.04% | -0.67% | ||
Pct Diff From Std | 8.52% | -7.48% | 6.45% | -25.37% | 14.43% | 20.44% | -6.83% | -20.37% | -14.62% |
Client Account - Materials 2 – GOOD Benford’s Match
Total Records | One | Two | Three | Four | Five | Six | Seven | Eight | Nine | ||
2137 | 641 | 391 | 267 | 187 | 171 | 147 | 120 | 129 | 84 | ||
Calculated Benford's | 30.00% | 18.30% | 12.49% | 8.75% | 8.00% | 6.88% | 5.62% | 6.04% | 3.93% | ||
Standard Benford's | 30.10% | 17.61% | 12.49% | 9.69% | 7.92% | 6.70% | 5.80% | 5.12% | 4.58% | ||
Diff From Std | -0.11% | 0.69% | 0.00% | -0.94% | 0.08% | 0.18% | -0.18% | 0.92% | -0.65% | ||
Pct Diff From Std | -0.36% | 3.91% | 0.00% | -9.70% | 1.06% | 2.75% | -3.17% | 18.02% | -14.10% |
My final two
examples are the preceding Materials Accounts.
Account Materials 1 with 1203 records provides us
with a Fair Benford’s match while Account Materials 2
with 2137 records provides us with a Good Benford’s match.
As with other analytical techniques, sample size matters.
The larger the sample, the more reliable the results.
Benford’s is
useful, but it isn’t perfect.
It’s designed to detect deviations from expected norms.
It will do so under two conditions.
For a Benford’s analysis to detect an anomaly, a person
that is attempting to perpetrate a fraud has either added
records or removed records in a manner that does not conform to
a Benford distribution.
So, if a transaction was never recorded, like an off the
books fraud, a kickback, a bribe or asset theft, it would not be
detected by Benford’s.
There are other
instances that Benford’s is of little service.
If the data set does not comply with the requirement that
it be “naturally occurring” then Benford’s won’t work.
For example, duplicate addresses or bank accounts cannot
be detected, yet, two employees with similar addresses might
indicate ghost employees or, an employee’s address that is also
a vendor’s address might indicate a shell company.
Other examples where Benford’s is lacking include
duplicate purchase orders or invoice numbers that could signal
duplicate payments, fraud or shell companies.
Additionally, Benford’s will not detect frauds like
contract rigging, defective deliveries, or defective shipments.
I like Benford’s
and the opportunities it presents.
Nevertheless, It’s best used as a corroborating, rather
than as a conclusive tool.
Closing Comments –
This analysis
was derived using Excel.
The files used were very large.
The data was obtained in 68 separate Excel files.
These then had to be consolidated.
Nothing was set up to perform the analysis.
Everything was created from scratch.
It was laborious and time consuming.
A work-a-day auditor
couldn’t spend this much time to get to these results.
A colleague provided the information I used for the
analysis. He spent a
great deal of time obtaining the information from client systems
for another purpose.
Had he not shared his data, this presentation would not have
happened. Just
getting the data would have taken too much time.
There are much
better ways to do this.
Many clients possess and use various forms of ACL
(Auditing Control Language) that provides a mechanism to
directly query and perform a Benford’s analysis on the resulting
data. With such
capability, Benford’s becomes a capable tool providing a
significant resource in identifying and localizing areas of
risk.
The software
firm ACL has introduced a module called
“Find Money Fast”
that incorporates Benford’s analysis as well as other supporting
auditing analytics.
From readings, it seems that the coordinated application of
Benford’s, in addition to other complimentary procedures, is an
effective approach to identifying and localizing auditing risk.
Contact Us