ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Error of Measurement	15
Statistical Analysis	15
Statistical Studies	15
Research Design	5
Evaluation Methods	4
Mathematical Models	4
Nonparametric Statistics	4
Sample Size	4
Sampling	4
Analysis of Variance	3
Data Analysis	3
Goodness of Fit	3
Monte Carlo Methods	3
Probability	3
Research Methodology	3
Research Problems	3
Statistical Significance	3
Adults	2
Data Collection	2
Estimation (Mathematics)	2
Item Response Theory	2
Mathematical Applications	2
Measurement Techniques	2
Models	2
Reliability	2
More ▼

Source

Evaluation in Education:…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Experimental…	1
National Center for Research…	1
ProQuest LLC	1

Publication Type

Reports - Research	9
Speeches/Meeting Papers	5
Journal Articles	4
Reports - Evaluative	3
Dissertations/Theses -…	1
ERIC Publications	1
Guides - Non-Classroom	1

Education Level

Elementary Secondary Education

Audience

Researchers

Location

Australia

Laws, Policies, & Programs

Assessments and Surveys

California Psychological…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

The Power and Type I Error of Wilcoxon-Mann-Whitney, Welch's "t," and Student's "t" Tests for Likert-Type Data

Peer reviewed
PDF on ERIC

Download full text

Simsek, Ahmet Salih – International Journal of Assessment Tools in Education, 2023

Likert-type item is the most popular response format for collecting data in social, educational, and psychological studies through scales or questionnaires. However, there is no consensus on whether parametric or non-parametric tests should be preferred when analyzing Likert-type data. This study examined the statistical power of parametric and…

Descriptors: Error of Measurement, Likert Scales, Nonparametric Statistics, Statistical Analysis

Assess Robustness of the Rasch Mixture Model to Detect Differential Item Functioning -- A Monte Carlo Study

Direct link

Jinjin Huang – ProQuest LLC, 2020

Measurement invariance is crucial for an effective and valid measure of a construct. Invariance holds when the latent trait varies consistently across subgroups; in other words, the mean differences among subgroups are only due to true latent ability differences. Differential item functioning (DIF) occurs when measurement invariance is violated.…

Descriptors: Robustness (Statistics), Item Response Theory, Test Items, Item Analysis

A New Statistic for Evaluating Item Response Theory Models for Ordinal Data. CRESST Report 839

Download full text

Cai, Li; Monroe, Scott – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2014

We propose a new limited-information goodness of fit test statistic C[subscript 2] for ordinal IRT models. The construction of the new statistic lies formally between the M[subscript 2] statistic of Maydeu-Olivares and Joe (2006), which utilizes first and second order marginal probabilities, and the M*[subscript 2] statistic of Cai and Hansen…

Descriptors: Item Response Theory, Models, Goodness of Fit, Probability

A Revision of School Effectiveness Analysis

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2012

Statistical modeling of school effectiveness data was originally motivated by the dissatisfaction with the analysis of (school-leaving) examination results that took no account of the background of the students or regarded each school as an isolated unit of analysis. The application of multilevel analysis was generally regarded as a breakthrough,…

Descriptors: School Effectiveness, Data Analysis, Statistical Analysis, Statistical Studies

Evidence on the Quality of Several Approximations for Commonly Used Measurement Statistics

Peer reviewed

McMorris, Robert F. – Journal of Educational Measurement, 1972

Approximations were compared with exact statistics obtained on 85 different classroom tests constructed and administered by professors in a variety of fields; means and standard deviation of the resulting differences supported the use of approximations in practical situations. (Author)

Descriptors: Error of Measurement, Measurement Instruments, Reliability, Statistical Analysis

Statistical Methodology in Meta-Analysis.

Download full text

Hedges, Larry V. – 1982

Meta-analysis has become an important supplement to traditional methods of research reviewing, although many problems must be addressed by the reviewer who carries out a meta-analysis. These problems include identifying and obtaining appropriate studies, extracting estimates of effect size from the studies, coding or classifying studies, analyzing…

Descriptors: Analysis of Variance, Correlation, Error of Measurement, Mathematical Models

Sample Design for Educational Survey Research.

Ross, Kenneth N. – Evaluation in Education: International Progress, 1978

Student's empirical sampling approach is used to assess the magnitude of the sampling errors of statistics describing a recursive causal model. The data were gathered with four complex sample designs commonly used in educational surveys. Jackknife and half-sample error estimates are applied to the data. (Author/CTM)

Descriptors: Error of Measurement, Foreign Countries, Probability, Research Design

Comparative Power of Student T Test and Mann-Whitney U Test for Unequal Sample Size and Variances.

Peer reviewed

Zimmerman, Donald W. – Journal of Experimental Education, 1987

A program obtained random samples from known populations, some of which violated the homogeneity assumption. Student t tests and Mann-Whitney U Tests were performed on the sample value. Where the t test led to incorrect decisions, the use of Mann-Whitney U test in its place led to poorer results. (JAZ)

Descriptors: Computer Software, Error of Measurement, Monte Carlo Methods, Nonparametric Statistics

An Exploration of the Robustness of Four Test Equating Models.

Download full text

Skaggs, Gary; Lissitz, Robert W. – 1985

This study examined how four commonly used test equating procedures (linear, equipercentile, Rasch Model, and three-parameter) would respond to situations in which the properties or the two tests being equated were different. Data for two tests plus an external anchor test were generated from a three parameter model in which mean test differences…

Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Goodness of Fit

Tests of Variance Equality When Distributions Differ in Form, Scale and Location.

Download full text

Olejnik, Stephen F.; Algina, James – 1986

Sampling distributions for ten tests for comparing population variances in a two group design were generated for several combinations of equal and unequal sample sizes, population means, and group variances when distributional forms differed. The ten procedures included: (1) O'Brien's (OB); (2) O'Brien's with adjusted degrees of freedom; (3)…

Descriptors: Error of Measurement, Evaluation Methods, Measurement Techniques, Nonparametric Statistics

Relationship Among the Number of Sub-Tests; Skewness, Kurtosis, and Size of Population; And Magnitude of Errors of Estimate in Multiple Matrix Sampling. (Revised Version).

PDF pending restoration

Misanchuk, Earl R. – 1978

Multiple matrix sampling of three subscales of the California Psychological Inventory was used to investigate the effects of four variables on error estimates of the mean (EEM) and variance (EEV). The four variables were examinee population size (600, 450, 300, 150, 100, and 75); number of subtests, (2, 3, 4, 5, 6, and 7), hence the number of…

Descriptors: Adults, Analysis of Variance, Error of Measurement, Item Sampling

Adjusting Scores on Examinations Offering a Choice of Questions.

Download full text

Livingston, Samuel A. – 1986

This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…

Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models

Bias vs. Precision: Combining Estimates in Multisite Evaluation Research.

Download full text

Bernstein, Lawrence; Burstein, Nancy – 1994

The inherent methodological problem in conducting research at multiple sites is how to best derive an overall estimate of program impact across multiple sites, best being the estimate that minimizes the mean square error, that is, the square of the difference between the observed and true values. An empirical example illustrates the use of the…

Descriptors: Bias, Comprehensive Programs, Data Analysis, Data Collection

Reliability Estimation for Aggregated Data: Applications for Organizational Research.

Download full text

Hart, Roland J.; Bradshaw, Stephen C. – 1981

This report provides the statistical tools necessary to measure the extent of error that exists in organizational record data and group survey data. It is felt that traditional methods of measuring error are inappropriate or incomplete when applied to organizational groups, especially in studies of organizational change when the same variables are…

Descriptors: Adults, Analysis of Variance, Error of Measurement, Mathematical Formulas

Can Nonexperimental Comparison Group Methods Match the Findings from a Random Assignment Evaluation of Mandatory Welfare-to-Work Programs? MDRC Working Papers on Research Methodology.

Download full text

Bloom, Howard S.; Michalopoulos, Charles; Hill, Carolyn J.; Lei, Ying – 2002

A study explored which nonexperimental comparison group methods provide the most accurate estimates of the impacts of mandatory welfare-to-work programs and whether the best methods work well enough to substitute for random assignment experiments. Findings were compared for nonexperimental comparison groups and statistical adjustment procedures…

Descriptors: Adult Education, Comparative Analysis, Control Groups, Error of Measurement

Algina, James	1
Bernstein, Lawrence	1
Bloom, Howard S.	1
Bradshaw, Stephen C.	1
Burstein, Nancy	1
Cai, Li	1
Hart, Roland J.	1
Hedges, Larry V.	1
Hill, Carolyn J.	1
Jinjin Huang	1
Lei, Ying	1
Lissitz, Robert W.	1
Livingston, Samuel A.	1
Longford, Nicholas T.	1
McMorris, Robert F.	1
Michalopoulos, Charles	1
Misanchuk, Earl R.	1
Monroe, Scott	1
Olejnik, Stephen F.	1
Ross, Kenneth N.	1
Simsek, Ahmet Salih	1
Skaggs, Gary	1
Zimmerman, Donald W.	1
More ▼