Page 2

Semester 1: B.Sc., Geology Choice Based Credit System Syllabus 2023-2024

Definition and scope of statistics Tabulation of data Formation of Frequency Distribution Diagrammatic representation of data Bar diagrams Pie diagrams
- Definition of Statistics
  Statistics is the branch of mathematics that deals with collecting, analyzing, interpreting, presenting, and organizing data. It serves as a method for understanding complex phenomena by transforming quantitative observations into meaningful information.
- Scope of Statistics
  The scope of statistics encompasses various fields including psychology, biology, economics, education, and geology. It aids in making informed decisions, conducting surveys, and predicting outcomes based on data.
- Tabulation of Data
  Tabulation refers to the systematic arrangement of data in rows and columns to facilitate easier comprehension and analysis. It helps summarize large volumes of data and highlights crucial information through organized layout.
- Formation of Frequency Distribution
  Frequency distribution is a summary of how often each value occurs in a data set. It groups data into intervals (bins) and counts the number of occurrences in each group, which is essential for understanding the distribution of data.
- Diagrammatic Representation of Data
  Diagrammatic representation involves visually displaying data through charts and diagrams, making it easier to understand and interpret patterns. It enhances analytical insights and communicates information effectively.
- Bar Diagrams
  Bar diagrams, or bar charts, represent categorical data with rectangular bars, where the length of each bar correlates with the value it represents. They provide a quick visual comparison of different categories.
- Pie Diagrams
  Pie diagrams, or pie charts, depict the proportion of parts to a whole by dividing a circle into slices. Each slice represents a category's contribution to the overall total, making it simple to visualize relative sizes.
Graphic Representation of data Histogram Frequency polygon Ogives
Graphic Representation of Data
- Histogram
  Histograms are graphical representations of the distribution of numerical data. They are created by dividing the data into intervals, known as bins, and displaying the frequency of data points in each bin. The height of each bar represents the frequency of data points within that interval. Histograms are useful for visualizing the shape of a dataset, identifying patterns, and detecting outliers.
- Frequency Polygon
  A frequency polygon is a graphical tool used to represent the distribution of a dataset similarly to a histogram. It is created by plotting the midpoints of each bin of a histogram and connecting these points with straight lines. Frequency polygons provide a clearer view of the data trends over ranges and are particularly useful for comparing multiple datasets.
- Ogives
  An ogive is a cumulative frequency graph that represents the cumulative frequency of a dataset at various intervals. It plots the upper class boundaries against the cumulative frequency of each class interval. Ogives help in understanding the distribution of data and in calculating percentiles and medians, making them valuable for statistical analysis.
Measures of Central Tendency Arithmetic Mean Median Mode Combined arithmetic mean merits and demerits
Measures of Central Tendency
- Arithmetic Mean
- Median
- Mode
- Combined Arithmetic Mean
Measures of Dispersion Absolute and Relative measures Range Quartile deviation Mean deviation Standard deviation
Measures of Dispersion
- Absolute Measures
  Absolute measures of dispersion provide a direct quantification of the spread of data points in a dataset. They do not relate to the size of the mean or the overall dataset. Common absolute measures include Range, Quartile Deviation, Mean Deviation, and Standard Deviation.
- Range
  Range is the simplest measure of dispersion, calculated as the difference between the maximum and minimum values in a dataset. It gives a quick insight into the variability of the data but does not consider the distribution of values.
- Quartile Deviation
  Quartile Deviation, also known as the semi-interquartile range, is a measure that indicates the spread of the middle 50% of values. It is calculated as half of the difference between the first quartile (Q1) and the third quartile (Q3). This measure is useful for understanding the dispersion of data in cases where outliers may skew the results.
- Mean Deviation
  Mean Deviation is the average of the absolute deviations from the mean of the dataset. It provides a measure of how much the values differ from the average value. This measure accounts for all values in the dataset but may be influenced by outliers.
- Standard Deviation
  Standard Deviation measures the average distance of each data point from the mean. A low standard deviation indicates that values tend to be close to the mean, while a high standard deviation indicates that values are spread out over a wider range. It is widely used due to its ability to incorporate all data points and its mathematical validity.
- Relative Measures
  Relative measures of dispersion enable comparison of variability between different datasets or groups. They include coefficients such as Coefficient of Variation, which relates the standard deviation to the mean. This measure allows for comparison between datasets of different units or scales.
Curve fitting by the Method of Least square Fitting straight line of the form Y = a + bx and parabola Y = a x2 + b x + c Simple problems
Curve fitting by the Method of Least Squares
- Introduction to Curve Fitting
  Curve fitting is a statistical technique used to create a curve that best fits a set of data points. It helps in understanding the relationship between variables by approximating the data with a mathematical function.
- Method of Least Squares
  The Method of Least Squares is a standard approach in regression analysis to minimize the gap between observed and predicted values. It is primarily used to find the best-fitting line or curve by minimizing the sum of the squares of the vertical distances of the points from the curve.
- Fitting a Straight Line (Y = a + bx)
  In linear regression, we fit a straight line to the data points in the form Y = a + bx, where Y is the dependent variable, x is the independent variable, a is the y-intercept, and b is the slope. The least squares method helps to calculate the values of a and b.
- Fitting a Parabola (Y = ax^2 + bx + c)
  When the relationship between the variables is quadratic, we use the equation Y = ax^2 + bx + c to fit a parabola to the data. In this case, a, b, and c are constants determined using the least squares method to minimize the error.
- Simple Problems and Examples
  Applying the methods discussed, consider a dataset with several points. Calculate the parameters a and b for a straight line using the least squares formula, and similarly determine a, b, and c for a quadratic curve, demonstrating the fitting techniques with numeric examples.
- Applications of Curve Fitting
  Curve fitting is widely used in various fields such as geostatistics, economics, biology, and engineering to analyze trends and make predictions based on empirical data.
Correlation Karl persons coefficient of correlation Rank correlation Spearmans Rank correlation coefficient
Correlation and Statistical Methods
- Introduction to Correlation
  Correlation is a statistical measure that describes the extent to which two variables are linearly related. It indicates the strength and direction of a linear relationship between two quantitative variables.
- Karl Pearson's Coefficient of Correlation
  Karl Pearson's coefficient of correlation, denoted as r, ranges from -1 to 1. A value of 1 indicates a perfect positive correlation, -1 indicates a perfect negative correlation, and 0 indicates no correlation. It is calculated using the formula: r = cov(X, Y) / (σX * σY), where cov(X, Y) is the covariance between variables X and Y, and σX and σY are the standard deviations of X and Y.
- Properties of Pearson's Correlation Coefficient
  Pearson's correlation coefficient has several key properties: it is unitless, sensitive to outliers, and measures only linear relationships. Its value is also affected by the sample size.
- Importance of Correlation in Geo-Statistics
  In geo-statistics, correlation helps in understanding the relationship between different geological variables, which is crucial for resource exploration and environmental assessment.
- Rank Correlation
  Rank correlation measures the degree of correspondence between two rankings of data. It provides an alternative to Pearson's Correlation when data does not meet the assumptions of normality.
- Spearman's Rank Correlation Coefficient
  Spearman's rank correlation coefficient, denoted as ρ (rho), is a non-parametric measure of rank correlation. It assesses how well the relationship between two variables can be described by a monotonic function. The formula involves ranking the data and then calculating the Pearson correlation coefficient on the ranks.
- Applications of Spearman's Rank Correlation
  Spearman's rank correlation is particularly useful in geostatistics when dealing with ordinal data or non-normally distributed data, allowing for meaningful inferences about relationships.
Regression regression equation and their properties
Regression and Regression Equation: Properties and Contextual Application in Geo-Statistics
- Introduction to Regression Analysis
  Regression analysis is a statistical method used to examine the relationship between two or more variables. In geology, it helps in understanding how geological factors relate to one another.
- Types of Regression
  There are various types of regression, including linear regression, multiple regression, logistic regression, and polynomial regression. Each type serves different types of relationships and data sets.
- Regression Equation
  A regression equation mathematically represents the relationship among variables. In its simplest form for linear regression, it is expressed as Y equals a + bX, where Y is the dependent variable, a is the intercept, b is the slope, and X is the independent variable.
- Properties of Regression
  Key properties of regression analysis include: 1. Linearity: The relationship is linear; 2. Independence: Observations are independent; 3. Homoscedasticity: Constant variance of errors; 4. Normal distribution: Errors should be normally distributed.
- Applications in Geo-Statistics
  Regression is widely used in geology for predictive modeling, such as estimating mineral resource potential, analyzing environmental impacts, and assessing geological hazards. It allows for a quantitative evaluation of how geological variables influence one another.

Page 2

Semester 1: B.Sc., Geology Choice Based Credit System Syllabus 2023-2024

Definition and scope of statistics Tabulation of data Formation of Frequency Distribution Diagrammatic representation of data Bar diagrams Pie diagrams

Definition of Statistics

Scope of Statistics

Tabulation of Data

Formation of Frequency Distribution

Diagrammatic Representation of Data

Bar Diagrams

Pie Diagrams

Graphic Representation of data Histogram Frequency polygon Ogives

Graphic Representation of Data

Histogram

Frequency Polygon

Ogives

Measures of Central Tendency Arithmetic Mean Median Mode Combined arithmetic mean merits and demerits

Measures of Central Tendency

Arithmetic Mean

Median

Mode

Combined Arithmetic Mean

Measures of Dispersion Absolute and Relative measures Range Quartile deviation Mean deviation Standard deviation

Measures of Dispersion

Absolute Measures

Range

Quartile Deviation

Mean Deviation

Standard Deviation

Relative Measures

Curve fitting by the Method of Least square Fitting straight line of the form Y = a + bx and parabola Y = a x2 + b x + c Simple problems

Curve fitting by the Method of Least Squares

Introduction to Curve Fitting

Method of Least Squares

Fitting a Straight Line (Y = a + bx)

Fitting a Parabola (Y = ax^2 + bx + c)

Simple Problems and Examples

Applications of Curve Fitting

Correlation Karl persons coefficient of correlation Rank correlation Spearmans Rank correlation coefficient

Correlation and Statistical Methods

Introduction to Correlation

Karl Pearson's Coefficient of Correlation

Properties of Pearson's Correlation Coefficient

Importance of Correlation in Geo-Statistics

Rank Correlation

Spearman's Rank Correlation Coefficient

Applications of Spearman's Rank Correlation

Regression regression equation and their properties

Regression and Regression Equation: Properties and Contextual Application in Geo-Statistics

Introduction to Regression Analysis

Types of Regression

Regression Equation

Properties of Regression

Applications in Geo-Statistics

B.Sc., Geology Choice Based Credit System Syllabus 2023-2024

B.Sc., Geology

Geo-Statistics

1

Periyar University

Geo-Statistics