ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Error of Measurement	20
Statistical Analysis	20
Sampling	7
Evaluation Methods	5
Research Methodology	5
Mathematical Models	4
Simulation	4
Statistical Studies	4
Comparative Analysis	3
Correlation	3
Equated Scores	3
Higher Education	3
Institutional Research	3
Measurement Techniques	3
Research Design	3
Sample Size	3
Achievement Tests	2
Adults	2
Analysis of Variance	2
College Seniors	2
Educational Research	2
Equations (Mathematics)	2
Essay Tests	2
Estimation (Mathematics)	2
Hypothesis Testing	2
More ▼

Source

Psychological Methods	3
Applied Measurement in…	1
Journal of Educational…	1
Journal of Speech and Hearing…	1
National Center for Education…	1
National Center for Education…	1
Research in Higher Education	1

Publication Type

Reports - Research	16
Speeches/Meeting Papers	8
Journal Articles	7
Guides - Non-Classroom	2
Reports - Evaluative	2
Books	1
Information Analyses	1
Numerical/Quantitative Data	1

Education Level

Elementary Education	1
Grade 10	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers	20
Practitioners	2
Administrators	1
Policymakers	1
Students	1

Location

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

The BASIE (BAyeSian Interpretation of Estimates) Framework for Interpreting Findings from Impact Evaluations: A Practical Guide for Education Researchers. Toolkit. NCEE 2022-005

Peer reviewed
PDF on ERIC

Download full text

Deke, John; Finucane, Mariel; Thal, Daniel – National Center for Education Evaluation and Regional Assistance, 2022

BASIE is a framework for interpreting impact estimates from evaluations. It is an alternative to null hypothesis significance testing. This guide walks researchers through the key steps of applying BASIE, including selecting prior evidence, reporting impact estimates, interpreting impact estimates, and conducting sensitivity analyses. The guide…

Descriptors: Bayesian Statistics, Educational Research, Data Interpretation, Hypothesis Testing

The Multilevel Latent Covariate Model: A New, More Reliable Approach to Group-Level Effects in Contextual Studies

Peer reviewed

Direct link

Ludtke, Oliver; Marsh, Herbert W.; Robitzsch, Alexander; Trautwein, Ulrich; Asparouhov, Tihomir; Muthen, Bengt – Psychological Methods, 2008

In multilevel modeling (MLM), group-level (L2) characteristics are often measured by aggregating individual-level (L1) characteristics within each group so as to assess contextual effects (e.g., group-average effects of socioeconomic status, achievement, climate). Most previous applications have used a multilevel manifest covariate (MMC) approach,…

Descriptors: Statistical Analysis, Sampling, Context Effect, Simulation

Testing Intergroup Concordance in Ranking Experiments with Two Groups of Judges

Peer reviewed

Direct link

Dekle, Dawn J.; Leung, Denis H. Y.; Zhu, Min – Psychological Methods, 2008

Across many areas of psychology, concordance is commonly used to measure the (intragroup) agreement in ranking a number of items by a group of judges. Sometimes, however, the judges come from multiple groups, and in those situations, the interest is to measure the concordance between groups, under the assumption that there is some within-group…

Descriptors: Item Response Theory, Statistical Analysis, Psychological Studies, Evaluators

Structural Equation Modeling of Multitrait-Multimethod Data: Different Models for Different Types of Methods

Peer reviewed

Direct link

Eid, Michael; Nussbeck, Fridtjof W.; Geiser, Christian; Cole, David A.; Gollwitzer, Mario; Lischetzke, Tanja – Psychological Methods, 2008

The question as to which structural equation model should be selected when multitrait-multimethod (MTMM) data are analyzed is of interest to many researchers. In the past, attempts to find a well-fitting model have often been data-driven and highly arbitrary. In the present article, the authors argue that the measurement design (type of methods…

Descriptors: Structural Equation Models, Multitrait Multimethod Techniques, Statistical Analysis, Error of Measurement

Using Explanatory Item Response Models to Analyze Group Differences in Science Achievement

Peer reviewed

Direct link

Briggs, Derek C. – Applied Measurement in Education, 2008

This article illustrates the use of an explanatory item response modeling (EIRM) approach in the context of measuring group differences in science achievement. The distinction between item response models and EIRMs, recently elaborated by De Boeck and Wilson (2004), is presented within the statistical framework of generalized linear mixed models.…

Descriptors: Science Achievement, Science Tests, Measurement, Error of Measurement

Equipercentile Test Equating: The Effects of Presmoothing and Postsmoothing on the Magnitude of Sample-Dependent Errors.

Download full text

Fairbank, Benjamin A., Jr. – 1985

The effectiveness of 19 methods of smoothing was investigated as those methods apply to the equipercentile method of test equating. Seven methods involved smoothing the score distribution before the tests were equated (presmoothing). Seven involved smoothing the resultant points after the equating (postsmoothing). Five methods involved combining…

Descriptors: Adults, Equated Scores, Equations (Mathematics), Error of Measurement

Efron's Bootstrap with Some Applications in Psychology.

Lunneborg, Clifford E. – 1983

The wide availability of large amounts of inexpensive computing power has encouraged statisticians to explore many approaches to a basis for inference. This paper presents one such "computer-intensive" approach: the bootstrap of Bradley Efron. This methodology fits between the cases where it is assumed that the form of the distribution…

Descriptors: Analysis of Variance, Error of Measurement, Estimation (Mathematics), Hypothesis Testing

Critical Differences in Aided Sound Field Thresholds in Children.

Peer reviewed

Stuart, Andrew; And Others – Journal of Speech and Hearing Research, 1990

Variability of aided sound field thresholds (ASFTs) was examined in 30 hearing-impaired children comprising 2 age groups (5-9 and 10-14 years). Findings showed that 2 ASFTs would have to differ by more than 10 decibels across signal test frequencies to attain statistical significance. (Author/DB)

Descriptors: Age Differences, Audiology, Auditory Evaluation, Children

Statistical Equating of Direct Writing Assessment.

Phillips, Gary W. – 1985

This paper provides empirical data on two approaches to statistically equate scores derived from the direct assessment of writing. These methods are linear equating and equating based on the general polychotomous form of the Rasch model. Data from the Maryland Functional Writing Test are used to equate scores obtained from two prompts given in…

Descriptors: Elementary Secondary Education, Equated Scores, Equations (Mathematics), Error of Measurement

Lies, Damn Lies, and Statistics Revisited: A Comparison of Three Methods of Representing Change. AIR 1991 Annual Forum Paper.

Download full text

Pike, Gary R. – 1991

Because change is fundamental to education and the measurement of change assesses the quality and effectiveness of postsecondary education, this study examined three methods of measuring change: (1) gain scores; (2) residual scores; and (3) repeated measures. Data for the study was obtained from transcripts of 722 graduating seniors at the…

Descriptors: Academic Achievement, College Seniors, Error of Measurement, Higher Education

Obtaining Maximum Likelihood Trait Estimates from Number-Correct Scores for the Three-Parameter Logistic Model.

Peer reviewed

Yen, Wendy M. – Journal of Educational Measurement, 1984

A procedure for obtaining maximum likelihood trait estimates from number-correct (NC) scores for the three-parameter logistic model is presented. It produces an NC score to trait estimate conversion table. Analyses in the estimated true score metric confirm the conclusions made in the trait metric. (Author/DWH)

Descriptors: Achievement Tests, Error of Measurement, Estimation (Mathematics), Latent Trait Theory

An Exploration of the Robustness of Four Test Equating Models.

Download full text

Skaggs, Gary; Lissitz, Robert W. – 1985

This study examined how four commonly used test equating procedures (linear, equipercentile, Rasch Model, and three-parameter) would respond to situations in which the properties or the two tests being equated were different. Data for two tests plus an external anchor test were generated from a three parameter model in which mean test differences…

Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Goodness of Fit

Tests of Variance Equality When Distributions Differ in Form, Scale and Location.

Download full text

Olejnik, Stephen F.; Algina, James – 1986

Sampling distributions for ten tests for comparing population variances in a two group design were generated for several combinations of equal and unequal sample sizes, population means, and group variances when distributional forms differed. The ten procedures included: (1) O'Brien's (OB); (2) O'Brien's with adjusted degrees of freedom; (3)…

Descriptors: Error of Measurement, Evaluation Methods, Measurement Techniques, Nonparametric Statistics

A Generalizability Study of the Angoff Method Applied to Setting Cutoff Scores of Professional Certification Tests.

Cope, Ronald T. – 1987

This study used generalizability theory and other statistical concepts to assess the application of the Angoff method to setting cutoff scores on two professional certification tests. A panel of ten judges gave pre- and post-feedback Angoff probability ratings of items of two forms of a professional certification test, and another panel of nine…

Descriptors: Certification, Correlation, Cutting Scores, Error of Measurement

Adjusting Scores on Examinations Offering a Choice of Questions.

Download full text

Livingston, Samuel A. – 1986

This paper deals with test fairness regarding a test consisting of two parts: (1) a "common" section, taken by all students; and (2) a "variable" section, in which some students may answer a different set of questions from other students. For example, a test taken by several thousand students each year contains a common multiple-choice portion and…

Descriptors: Difficulty Level, Error of Measurement, Essay Tests, Mathematical Models

Previous Page | Next Page »

Pages: 1 | 2

Pike, Gary R.	2
Algina, James	1
Asparouhov, Tihomir	1
Bradbury, Denise	1
Bradshaw, Stephen C.	1
Briggs, Derek C.	1
Cole, David A.	1
Cope, Ronald T.	1
Deke, John	1
Dekle, Dawn J.	1
Eid, Michael	1
Fairbank, Benjamin A., Jr.	1
Fink, Arlene	1
Finucane, Mariel	1
Geiser, Christian	1
Gollwitzer, Mario	1
Hart, Roland J.	1
Kaufman, Phillip	1
Leung, Denis H. Y.	1
Lischetzke, Tanja	1
Lissitz, Robert W.	1
Livingston, Samuel A.	1
Ludtke, Oliver	1
Lunneborg, Clifford E.	1
More ▼