Unit 6.1 - Introduction to Statistics
Introduction to Statistics
Statistics is the discipline
(some consider branch of mathematics) that concerns the collection, organization,
analysis, interpretation and presentation of data.
Data: Set of information (any fact, figure, sign,
symbol, number) describes a given entity
Data Types: Qualitative data, Quantitative data
Data Analysis
- Descriptive Statistics: Quantitatively describes or summarizes features of collected information using table, graphs and calculated using central tendency, dispersion, index number. Generalization of data is not possible, only provide worthy information regarding the nature of a specific group of individuals.
- Inferential Statistics: Learn about the population through appropriate sample of data using probability, estimation, different statistical test. It facilitates the generalization of data.
Definitions of Statistics
"Methods for making wise decisions on the
face of uncertainty.” —Wallis and Roberts
"The science of
quantitative reasoning" — Stephen Senn
Singular Sense (Process)
In
the singular sense, statistics refers to the "Methodology and principles
of Statistical Analysis" or "Stages of Investigations".
For
example: Investigation:
Distribution of student's weights in DAV.
- First: Information collection (college records, or directly from student)
- Second: Presentation using table & graph (Arrangement of weights in various groups such as “50 Kg to 60 Kg”, “60 Kg to 70 Kg” )
- Third: Analysis (computing averages or some other measures)
- Fourth: Interpretation (either there is obesity or not)
All this involves a procedure and a
method from the primary stage to the final stage of analysis and conclusions.
"Science of averages”,
"Science of Counting", "Science of estimates and
probability"
“Statistics may be defined as the collection,
presentation, analysis, and interpretation of numerical data.” —Croxton and
Cowden
Methodology of Statistical Analysis or Stages of Investigations
- 1st: Collection of data from the field of inquiry using different methods like primary and secondary, census and sampling etc.
- 2nd: Organization (classification) of data in a systematic manner according to date, time, place
- 3rd: Presentation of data to make simple and attractive using table and graphs
- 3rd: Analysis of data using various tools like average, co-relation, regression, dispersion, etc.
- 4th: Interpretation of data to draw the conclusion, to make comparison, to forecast
Plural Sense (Feature)
In
the plural sense, statistics refers to the "characteristics or features or
quality" contained in it.
"All statistics
are numerical statements of facts, but all facts numerically stated are not
statistics".
“By statistics we mean
aggregates of facts affected to a marked extent by multiplicity of causes,
numerically expressed, enumerated or estimated according to reasonable standard
of accuracy, collected in a systematic manner for a predetermined purpose, and
placed in relation to each other.”—Horace Sacrist
Features of Statistics
1.
Aggregate of Facts: single and unconnected
numerical figure has no meaning.
2. Numerical Expression: Countable quantity only
not the qualitative phenomena like honesty, goodness, ability, beauty etc. But if we assign numerical expression, it may be
described as ‘statistics’.
3. Systematic Collection: By applying suitable
methods and procedure (rough and haphazard method is not
desirable)
4. Pre-determined purpose: Must have a
well-defined purpose, specific aims, and objectives before we collect data
5. Comparable: To make a valid comparison, the data
should be homogeneous. For e. g., passed number of students of DAV and other
College constitute statistics but weight of the students & their intelligence
does not constitute statistics.
6. Affected
by Multiplicity of Causes: In social sciences, there may be combined effect of multiple
causes. For example, deterioration of academic achievement score of some students
may not be only due to lack of interest in school subjects, but may also due to
lack of motivation, effective teaching methods, attitude of the students on
school subjects etc.
7. Level
of Accuracy: No
any unique standard of accuracy, it differs from enquiry to enquiry or from purpose to
purpose and availability of time and resources. For e. g. we can ignore ten
deaths in a country but we cannot ignore even a single death in a family.
Scope or Importance of Statistics
in Different Fields
In ancient times, statistics was regarded
as the science of statecraft and was used to collect information related to
military force, population, wealth, etc. But in modern times, the use of
statistics is broad.
Statistics plays a vital role in
every field of human activity and holds a central position in almost every
field, including industry, commerce, trade, physics, chemistry, economics,
mathematics, biology, botany, psychology, astronomy, etc., so the application
of statistics is very wide.
- In Business: To take accurate managerial decision for businessman by analyzing risk & uncertainty, taste & preference of the consumer, quality of the product, location of business, marketing, financial resource management, production possibility, investment & output
- In Mathematics: The large number of statistical methods like probability, averages, dispersion, estimation etc. are used in mathematics. So, advancement in statistical techniques are the outcome of wide applications of mathematics.
- In Banking: Liquidity management, revenue forecasting, impact analysis of planning (IMPLAN), interest rate determination
- In state management (Administration): Formulation of policy and plan or administrative decision of the government based on statistics. For e. g. in budget formulation (tax & subsidy), in fiscal policy formulation (economic policy, industrial policy, trade policy, wage policy)
- In Accounting & Auditing: Accounting is not possible without exactness. But for decision making process, so much precision is not essential. So, decision can be made on approximation using accounting results (balance sheet, CFS, P&L statement) as statistics. Auditing process may apply sampling techniques to draw conclusion.
Importance of Statistics in Economics
“Without an adequate understanding of the statistical methods, the
investigators in the social sciences may be like the blind man groping in a
dark room for a black cat that is not there."
Croxton & Cowden--
Statistical data and techniques of statistical analysis are immensely useful (irreplaceable tool) in economics. Economics uses statistical significance often to get necessary facts and figures.
Statistics
and Economics are interrelated with each other. Almost every branch of
Economics uses statistics i.e., consumption, production, distribution,
exchange, public finance. All these Economic branches use statistics for
comparison, presentation, interpretation, and so on.
1. Quantitative
Expression of Economic Problems and activities: Poverty, unemployment, economic inequality, inflation,
NIA, BOT (X&M), BOP, business cycle.
2. Comparisons: inter-sectoral comparison (agriculture, tourism,
hydroelectricity), inter-temporal comparisons (over different plan periods
of the rural and urban unemployment)
3.
Cause
and Effect Relationship: price
& demand, GDP growth rate & poverty level, migration & imports
4.
Economic
Equilibrium: To understand the
behavior of the producer and consumer in
the market by equilibrium condition.
For Producer – max. profit and for Consumer – max. satisfaction
5.
Developing
Economic Theories: To
understand, analyze and develop (Law of demand, law of supply, theory of
production, theory of consumer behavior, quantity theory of money, theory of
intl' trade)
6.
Forecasting: Statistical data and tool like time series analysis,
estimation and probability
are useful to forecast the changes
in economic variables. This information enables economists or government
to formulate polices and suggestions to overcome the problem.
7.
Formulation
of economic policies: Such as fiscal
policies (budget), trade policy, industrial policy and monetary policies are
determined by the help of statistics.
Functions of Statistics
1. Present Facts in Definite Form: Vague idea or data into definite and precise form with the help of statistical tools and techniques. For e. g. in Nepal, Population and poverty is increasing in faster rate (Idea) - 1.35% and 18.7 (Fact)
2. Condensation of Data: Simplifying complex data to simple to make them understandable with the help of graph, diagram or through an average etc. For e. g. general price level, PCI.
3. Comparisons: Data can be correlated as well as compared by using average, ratio, coefficients etc.
4. Formulation and Testing of Hypothesis and Develop New Theory: For e. g. (1) effect of imposing tax on the exports of tea on the consumption of tea in other countries, (2) whether hydroxychloroquine is effective in preventing COVID-19, (3) whether students have been benefited from the online classes during lock-down etc., can be tested by appropriate statistical tools.
5. Forecasting: To formulate suitable plan and policies for future, statistics provide tools to analyze trend, tendencies, pattern and nature of data. For e. g. demand forecasting, business expansion plan.
6. Draw Valid Inferences: With the measure of reliability about the population parameters from the sample data.
2. Condensation of Data: Simplifying complex data to simple to make them understandable with the help of graph, diagram or through an average etc. For e. g. general price level, PCI.
3. Comparisons: Data can be correlated as well as compared by using average, ratio, coefficients etc.
4. Formulation and Testing of Hypothesis and Develop New Theory: For e. g. (1) effect of imposing tax on the exports of tea on the consumption of tea in other countries, (2) whether hydroxychloroquine is effective in preventing COVID-19, (3) whether students have been benefited from the online classes during lock-down etc., can be tested by appropriate statistical tools.
5. Forecasting: To formulate suitable plan and policies for future, statistics provide tools to analyze trend, tendencies, pattern and nature of data. For e. g. demand forecasting, business expansion plan.
6. Draw Valid Inferences: With the measure of reliability about the population parameters from the sample data.
"The fundamental gospel of statistics is to push back the domain of ignorance, rule of thumb, arbitrary or premature decisions, traditions and dogmatism and to increase the domain in which decisions are made and principles are formulated on the basis of analyzed quantitative facts"
– Robert W. Burgess
Limitations of Statistics
"There are three kinds of lies: lies, damned lies, statistics" - Benjamin Disraeli
"Statistics does not have any label for their quality". - Wilford I. King
Studying the boundaries of statistics helps to be aware of possible mistakes.
(1) Ignores Qualitative Phenomenon: A qualitative phenomenon like empowerment, leadership, honesty, character, culture, love, care, friendship, emotional impact of racial discrimination, etc., can’t be expressed numerically or direct statistical analysis. However, that phenomenon can be expressed indirectly of numbers and analyzed in statistics. But translated poor qualitative data into quantitative form may not always valid and provide faulty results.
(2) Results are True Only on Average: “Statistics largely deals with averages and these
averages may be made up of individual items radically different from each
other.” —Wilford I. King
For e.g. (a) If the average grade of two sections of students is same, it doesn’t mean that all the students in section A got the same marks as all the students in section B.(b) If it is said that Nepalese are poor, then this statement points to a trend. This does not mean that there is no rich in Nepal.
(3) Liable to be Misused: “Statistics are like clay of which you can make a ‘God’ or a ‘Devil’ as you please.” - Wilford I. King (Chances of misuse)
“The
science of statistics is a useful servant but only of great value to
those who understand its proper use.” -
Wilford I. King
(Knowledge required)
Data may have been collected by inexperienced persons or they may have been dishonest or biased. The validity of certain statistical methods depends on the nature of data, the levels of measurement, the knowledge of the pertinent aspects of the situations and lastly, the assumptions made vis-a-vis the sample from which the data are secured. The researcher may be tempted to twist results to suit his fond conclusions or hypotheses by applying very sensitive tools. So, data must be used with a caution. Otherwise results may prove to be disastrous.
(4) To Many Methods to Study Problems: “It must not be assumed that statistics is the only method to use in research, neither should this method be considered the best attack for the problem.” – Frederick Croxten and Dudley Cowden
(5) Its Laws are Not Exact: Although the law of inertia of large numbers and law of statistical regularity are the fundamental laws of statistics, these laws do not give definite results like other scientific laws. Results from statistics may not be the final ‘truth’ because statistical analysis rely on approximation and probability but not in terms of certainty. For example, according to the law of statistics, if we roll a dice 60 times, we are likely to get the number 3 ten times, but this isn’t a guarantee.
"Statistics does not have any label for their quality". - Wilford I. King
Studying the boundaries of statistics helps to be aware of possible mistakes.
(1) Ignores Qualitative Phenomenon: A qualitative phenomenon like empowerment, leadership, honesty, character, culture, love, care, friendship, emotional impact of racial discrimination, etc., can’t be expressed numerically or direct statistical analysis. However, that phenomenon can be expressed indirectly of numbers and analyzed in statistics. But translated poor qualitative data into quantitative form may not always valid and provide faulty results.
For e.g. (a) If the average grade of two sections of students is same, it doesn’t mean that all the students in section A got the same marks as all the students in section B.(b) If it is said that Nepalese are poor, then this statement points to a trend. This does not mean that there is no rich in Nepal.
(3) Liable to be Misused: “Statistics are like clay of which you can make a ‘God’ or a ‘Devil’ as you please.” - Wilford I. King (Chances of misuse)
Data may have been collected by inexperienced persons or they may have been dishonest or biased. The validity of certain statistical methods depends on the nature of data, the levels of measurement, the knowledge of the pertinent aspects of the situations and lastly, the assumptions made vis-a-vis the sample from which the data are secured. The researcher may be tempted to twist results to suit his fond conclusions or hypotheses by applying very sensitive tools. So, data must be used with a caution. Otherwise results may prove to be disastrous.
(4) To Many Methods to Study Problems: “It must not be assumed that statistics is the only method to use in research, neither should this method be considered the best attack for the problem.” – Frederick Croxten and Dudley Cowden
(5) Its Laws are Not Exact: Although the law of inertia of large numbers and law of statistical regularity are the fundamental laws of statistics, these laws do not give definite results like other scientific laws. Results from statistics may not be the final ‘truth’ because statistical analysis rely on approximation and probability but not in terms of certainty. For example, according to the law of statistics, if we roll a dice 60 times, we are likely to get the number 3 ten times, but this isn’t a guarantee.
Statistical Tools Used in Economic Analysis
"Statistics
are the straw out of which I, like every other economist, have to make
bricks."
Alfred Marshall, 1980–
1.
Collection of data (Primary or secondary)
2.
Editing
3.
Classification and tabulation
4. Tools
of presentation: Diagrams and Graphs (Of various types)
5.
Measures of Central Tendency: Mean, Mode, Median, G.M, H.M to find average
6.
Measures of Dispersion: Q.D, M.D., S.D. to find variability of an
individual item from central value.
7.
Skewness and Kurtosis –
(a) Skewness: Skewness is asymmetry in statistical distribution, in which the curve appears distorted or skewed either to the left or to the right.
Skewed Distribution
Symmetrical
Distribution: Mean = Median = Mode
Positive
Skewed Distribution: Mean > Median > Mode
Negative
Skewed Distribution: Mean <Median < Mode
(b) Kurtosis: Kurtosis is the degree of flatness or peaked-ness
in the region of mode of a frequency curve.
8.
Correlation and regression:
(a)
Correlation: Correlation (dependence) shows whether and how strongly pairs of
variables are related (causal or not).
(b) Regression: Regression attempts to determine the strength and
character of the relationship between one dependent variable and a series of
other independent variables.
Simple linear regression: Y = a + bX + u
- Y = the variable
that you are trying to predict (dependent variable).
- X = the variable
that you are using to predict Y (independent variable).
- a = the intercept.
- b = the slope.
- u = the regression
residual.
QdX = f (Px) 👉 QdX = a - b PX
9.
Index numbers:
An index number is the measure of change in
a variable (or group of variables) over time. It is
typically used in economics to measure trends in a wide variety of areas
including: stock market prices, cost of living, industrial or agricultural
production, and imports. Laspeyres, Paasche, Fishers' index number are popular
methods to calculate index number.
10.
Time Series Analysis:
It deals with time series data or trend analysis. Time series data means that
data is in a series of particular time periods or intervals.
11. Probability: In ordinary language the
term probability refers to the chance of happening or not happening
of an event. 'Chance' refers to uncertainty. In economics, it is used as risk
analysis to predict the future value.
View Video:
Download PDF: Introduction to Statistics
Comments
Post a Comment
If you have any doubt, Please let me know !