The Word Statistics Meaning and Use

The post is about “The Word Statistics Meaning and Use”.

The word statistics was first used by German scholar Gottfried Achenwall in the middle of the 18th century as the science of statecraft concerning the collection and use of data by the state.

The word statistics comes from the Latin word “Status” or Italian word “Statistia” or German word “Statistik” or the French word “Statistique”; meaning a political state, and originally meant information useful to the state, such as information about sizes of the population (human, animal, products, etc.) and armed forces.

itfeature.com The word Statistics

According to pioneer statistician Yule, the word statistics occurred at the earliest in the book “The Element of universal erudition” by Baron (1770). In 1787 a wider definition was used by E.A.W. Zimmermann in “A Political Survey of the Present State of Europe”. It appeared in the Encyclopedia of Britannica in 1797 and was used by Sir John Sinclair in Britain in a series of volumes published between 1791 and 1799 giving a statistical account of Scotland. In the 19th century, the word statistics acquired a wider meaning covering numerical data of almost any subject and also interpretation of data through appropriate analysis.

The Word Statistics Now a Day

Now statistics are being used with different meanings.

  • Statistics refers to “numerical facts that are arranged systematically in the form of tables or charts etc. In this sense, it is always used as a plural i.e. a set of numerical information. For instance statistics on prices, road accidents, crimes, births, educational institutions, etc.
  • The word statistics is defined as a discipline that includes procedures and techniques used to collect, process, and analyze numerical data to make inferences and to reach an appropriate decision in a situation of uncertainty (uncertainty refers to incompleteness, it does not imply ignorance). In this sense word statistic is used in the singular sense. It denotes the science of basing the decision on numerical data.
  • The word statistics refers to numerical quantities calculated from sample observations; a single quantity calculated from sample observations is called statistics such as the mean. Here word statistics is plural.

“We compute statistics from statistics by statistics”

The first place of statistics is plural of statistics, in second place is plural sense data, and in third place is singular sense methods.

In another way, the word Statistics has two meanings:

  • The science of data:
    In this sense, statistics deals with collecting, analyzing, interpreting, and presenting numerical data. Therefore, statistics helps us to understand the world around us by making sense of large amounts of information. Statisticians use a variety of techniques to summarize data, identify patterns, and draw wise conclusions.
  • Pieces of data:
    Statistics also refers to the actual numerical data itself, for example, averages, percentages, or other findings from a study. The real-life examples of statistics are: (i) unemployment statistics or (ii) crime statistics.

Most Common Uses of Statistics

The following are the most common uses of Statistics in various fields of life.

Business and Economics

  • Market Research: Understanding consumer behaviour, satisfaction, preferences, and trends.
  • Operations Management: Optimizing processes, inventory control, and quality control.
  • Financial Analysis: Evaluating investments, risk management, and financial performance.

Healthcare

  • Clinical Trials: Compare and Evaluate the effectiveness and safety of new treatments.
  • Epidemiology: Studying the occurrence and distribution of diseases.
  • Public Health: Identifying health risks and developing prevention strategies.

Social Sciences

  • Sociology: Studying social phenomena, such as inequality, crime, and education.
  • Psychology: Understanding human behaviour, personality, and cognition.
  • Political Science: Analyzing political behaviour, public opinion, and election outcomes.

Government

  • Policy Development: Making informed decisions based on data and evidence.
  • Economic Planning: Forecasting economic growth and trends.
  • Public Administration: Improving efficiency and effectiveness of government services.

Education

  • Educational Research: Evaluating teaching methods, curriculum, and student outcomes.
  • Testing and Assessment: Developing and analyzing standardized tests.
  • Student Data Analysis: Identifying trends and addressing educational disparities.

Science and Technology

  • Research: Designing experiments, collecting data, and analyzing results.
  • Data Analysis: Discovering patterns, relationships, and insights in large datasets.
  • Machine Learning: Developing algorithms that can learn from data and make predictions.

Sports

  • Player Performance Analysis: Evaluating athlete performance and identifying areas for improvement.
  • Team Strategy: Developing game plans and making tactical decisions.
  • Sports Betting: Analyzing data to predict game outcomes.

For learning about the Basics of Statistics Follow the link Basic Statistics

Learn R Language

P value and Significance Level

Difference Between the P value and Significance Level?

Basically in hypothesis testing the goal is to see if the probability value is less than or equal to the significance level (i.e., is p ≤ alpha). It is also called the size of the test or the size of the critical region. It is generally specified before any samples are drawn so that the results obtained will not influence our choice.

p value and significance level

The difference between P Value and Significance Level is

  • The probability value (also called the p-value) is the probability of the observed result found in your research study occurring (or an even more extreme result occurring), under the assumption that the null hypothesis is true (i.e., if the null were true).
  • In hypothesis testing, the researcher assumes that the null hypothesis is true and then sees how often the observed finding would occur if this assumption were true (i.e., the researcher determines the p-value).
  • The significance level (also called the alpha level) is the cutoff value the researcher selects and then uses to decide when to reject the null hypothesis.
  • Most researchers select the significance or alpha level of 0.05 to use in their research; hence, they reject the null hypothesis when the p-value is less than or equal to 0.05.
  • The key idea of hypothesis testing is that you reject the null hypothesis when the p-value is less than or equal to the significance level of 0.05.
https://itfeature.com P-value and statistical significance

Learn about Regression Coefficients

Learn about Weighted Least Squares in R Language

Testing of Hypothesis or Hypothesis Testing Made Easy

To whom is the researcher similar in hypothesis testing: the defense attorney or the prosecuting attorney? Why?

The researcher is similar to the prosecuting attorney in the sense that the researcher brings the null hypothesis “to trial” when she believes there is a probability of strong evidence against the null.

  • Just as the prosecutor usually believes that the person on trial is not innocent, the researcher usually believes that the null hypothesis is not true.
  • In the court system, the jury must assume (by law) that the person is innocent until the evidence calls this assumption into question; analogously, in hypothesis testing the researcher must assume (to use hypothesis testing) that the null hypothesis is true until the evidence calls this assumption into question.
Hypothesis Testing

The world aournd us is complex enough and full of uncertainty. Onlyobserving the data can not tell us if a pattern or relationship exists, or if it is just due to random chance. Therefore, we need hypthesis testing procedure that provides us a systematic method to analyze the sample data and draw conclusions (or make wise decisions) about a larger population, with a clear understanding of the likelihood of being wrong.

In conclusion, like statistical estimation, the statistical hypothesis testing is a cornerstone of statistical analysis. It provides a way to move beyond simply observing data and allows us to draw meaningful inferences about populations, evaluate claims, and make informed decisions in the face of uncertainty.

Testing of Hypothesis

Testing of Hypothesis in R Programming Language

Interpreting Regression Coefficients

Interpreting Regression Coefficients in Multiple Regression

In multiple regression models, for the interpreting regression coefficients, case, the unstandardized multiple regression coefficient is interpreted as the predicted change in $Y$ (i.e., the dependent variable abbreviated as DV) given a one-unit change in $X$ (i.e., the independent variable abbreviated as IV) while controlling for the other independent variables included in the equation.

Interpreting Regression Coefficients in Multiple Regression
  • The regression coefficient in multiple regression is called the partial regression coefficient because the effects of the other independent variables have been statistically removed or taken out (“partially out”) of the relationship.
  • If the standardized partial regression coefficient is being used, the coefficients can be compared for an indicator of the relative importance of the independent variables (i.e., the coefficient with the largest absolute value is the most important variable, the second is the second most important, and so on.)
SPSS Output: Interpreting Regression Coefficients

Interpreting regression coefficients involves understanding the relationship between the IV(s) and the DV in a regression model.

  • Magnitude: The coefficient tells about the change in the DV associated with a one-unit change in the IV, holding all other variables constant. For example, if the regression coefficient for IV (regressor) is 0.5, then it means that for every one-unit increase in that predictor, the DV is expected to increase by 0.5 units while keeping all else equal.
  • Direction: The sign of the regression coefficient (+ or -) indicates the direction of the relationship between the IV and DV. A positive coefficient means that as the IV increases, the DV is expected to increase as well. A negative coefficient means that as the IV increases, the DV is expected to decrease.
  • Statistical Significance: The statistical significance of the coefficient is important to consider. The significance of a regression coefficient tells about whether the relationship between the IV and the DV is likely to be due to chance or if it’s statistically meaningful. Generally, if the p-value of a regression coefficient is less than a chosen significance level (say 0.05), then that coefficient will be considered to be statistically significant.
  • Interaction Effects: The relationship between an IV and the DV may depend on the value of another variable. In such cases, the interpretation of regression coefficients may involve the interaction effects, where the effect of one variable on the DV varies depending on the value of another variable.
  • Context: Always interpret coefficients in the context of the specific problem being investigated. It is quite possible that a coefficient might not make practical sense without considering the nature of the data and the underlying phenomenon being studied.

Therefore, the interpretation of regression coefficients should be done carefully. The assumptions of the regression model, and the limitations of the data, should be considered. On the other hand, interpretation may differ based on the type of regression model being used (e.g., linear regression, logistic regression) and the specific research question being addressed.

statistics help https://itfeature.com

How to interpret Coefficients of Simple Linear Regression Model

Performing Linear Regression Analysis in R Language