Asked Nov 22, 2019

Do a small research about Benford’s law. Explain what it is and how it can be used in data science. Keep that in mind that I am not asking to prove Benford’s law. All you need is to explain the law and talk about its application in data science.


Expert Answer

Step 1

Benford’s law is also known as the law of anomalous numbers or the first-digit law. Simon Newcomb is thought to have been the first to discover the phenomenon that is now known as Benford’s law.

Newcomb proposed a law that the probability of a single number N being the first digit of a number was equal to log(d+1) – log(d).

Step 2

Frank Benford is an American physicist who revisited the phenomenon in 1938 which he called the “Law of Anomalous Numbers” in a survey with more than 20,000 observations of empirical data compiled from various resources. All of the sources, to a greater or a lesser extent, followed such an exponentially diminishing distribution.

Hill was the one who gave a satisfactory explanation of this law in 1998.

Step 3

A set of numbers is said to satisfy Benford’s law if the leading ...


Image Transcriptionclose

P(d) log, (d+1)- log, (d) d 1 log 1 log| 1+


Want to see the full answer?

See Solution

Check out a sample Q&A here.

Want to see this answer and more?

Solutions are written by subject experts who are available 24/7. Questions are typically answered within 1 hour.*

See Solution
*Response times may vary by subject and question.
Tagged in



Advanced Topics in Statistics

Related Statistics Q&A

Find answers to questions asked by student like you
Show more Q&A

Q: Suppose that the number of Facebook friends users have is normally distributed with a mean of 112 an...

A: Hey there! Thank you for posting the question. Since your question has more than 3 parts, we are sol...


Q: A research center claims that 31 % of adults in a certain country would travel into space on a comme...

A: The provided information is:


Q: Assume that a sample is used to estimate a population proportion p. Find the 80% confidence interval...

A: The sample proportion is 0.11.The sample size is 242.Consider the confidence interval as 80%, the tw...


Q: This is a statistics class practice question

A: From the given information, the different routes and their total distance are obtained as follows:Ro...


Q: For Confidence interval problems: 1. Mention the name of the interval (for example Z interval or t i...

A: 1. The t interval is used for constructing the confidence interval for the given data. 2. Assumption...


Q: 3.

A: The distribution of demand follows uniform (0, 5)


Q: The following data represent the muzzle velocity​ (in feet per​ second) of rounds fired from a​ 155-...

A: a.From the provided information, the two measurements (A and B) are taken on the same round. Therefo...


Q: To the right are the outcomes that are possible when a couple has three children. Assume that boys a...

A: Given;Total events = 8Number of events that contains exactly three girls = 1The formula to calculate...


Q: I'm confused as to what distrubution I should use to solve this probability. When tall and colorful ...

A: The multinomial distribution can be used if the experiment having more than two outcomes and the out...