ERIC - Search Results

Publication Date

In 2026	0
Since 2025	53
Since 2022 (last 5 years)	411
Since 2017 (last 10 years)	914
Since 2007 (last 20 years)	1965

Descriptor

Error of Measurement	3311
Statistical Analysis	600
Scores	510
Item Response Theory	449
Correlation	434
Comparative Analysis	424
Foreign Countries	418
Test Reliability	412
Computation	407
Simulation	370
Reliability	357
Sample Size	354
Models	353
Evaluation Methods	350
Test Items	349
Measurement Techniques	318
Factor Analysis	311
Sampling	301
Statistical Bias	300
Research Methodology	288
Goodness of Fit	260
Psychometrics	259
Monte Carlo Methods	258
Regression (Statistics)	246
Mathematical Models	241
More ▼

Author

Raykov, Tenko	23
Brennan, Robert L.	19
Kolen, Michael J.	19
Lord, Frederic M.	17
Thompson, Bruce	16
Zimmerman, Donald W.	16
Lee, Won-Chan	15
Livingston, Samuel A.	14
McCaffrey, Daniel F.	14
Yuan, Ke-Hai	14
van der Linden, Wim J.	14
Cai, Li	13
Moses, Tim	13
Beretvas, S. Natasha	12
Marsh, Herbert W.	12
Zwick, Rebecca	12
Algina, James	11
Ferron, John M.	11
Lee, Guemin	11
Lockwood, J. R.	11
Marcoulides, George A.	11
Reardon, Sean F.	11
DeMars, Christine E.	10
Henson, Robin K.	10
More ▼

Education Level

Higher Education	271
Secondary Education	201
Postsecondary Education	197
Elementary Education	194
Elementary Secondary Education	126
Middle Schools	98
High Schools	82
Junior High Schools	78
Early Childhood Education	61
Grade 4	48
Intermediate Grades	44
Primary Education	42
Grade 8	40
Grade 3	39
Grade 5	39
Grade 7	33
Kindergarten	24
Adult Education	23
Grade 6	19
Grade 2	17
Preschool Education	16
Grade 1	15
Grade 10	12
Grade 9	12
Two Year Colleges	6
More ▼

Audience

Researchers	93
Practitioners	23
Teachers	22
Policymakers	10
Administrators	5
Students	4
Counselors	2
Parents	2
Community	1

Location

United States	47
Germany	42
Australia	34
Canada	27
Turkey	27
California	22
United Kingdom (England)	20
Netherlands	18
China	17
New York	15
United Kingdom	15
North Carolina	14
Texas	14
Italy	12
South Korea	12
Florida	11
Indonesia	11
New Zealand	11
Pennsylvania	11
Spain	11
Japan	10
Taiwan	10
Iran	9
Norway	9
Portugal	9
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	11
Race to the Top	6
Elementary and Secondary…	4
Aid to Families with…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Family Educational Rights and…	1
Guaranteed Student Loan…	1
Head Start	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
Strengthening Career and…	1
More ▼

What Works Clearinghouse Rating

Does not meet standards

Showing 2,716 to 2,730 of 3,311 results Save | Export

An Interpretation of Livingston's Reliability Coefficient for Criterion-Referenced Tests.

PDF pending restoration

Harris, Chester W. – 1971

Livingston's work is a careful analysis of what occurs when one pools two populations with different means, but similar variances and reliability coefficients. However, his work fails to advance reliability theory for the special case of criterion-referenced testing. See ED 042 802 for Livingston's paper. (MS)

Descriptors: Analysis of Variance, Criterion Referenced Tests, Error of Measurement, Reliability

Automated Hypothesis Tests and Standard Errors for Nonstandard Problems with Description of Computer Package: A Draft.

Download full text

Lord, Frederic M.; Stocking, Martha – 1972

A general Computer program is described that will compute asymptotic standard errors and carry out significance tests for an endless variety of (standard and) nonstandard large-sample statistical problems, without requiring the statistician to derive asymptotic standard error formulas. The program assumes that the observations have a multinormal…

Descriptors: Bulletins, Computer Programs, Data Processing, Error of Measurement

Analyzing Ratings with Correlated Intrajudge Measurement Errors

Peer reviewed

Werts, C. E.; And Others – Educational and Psychological Measurement, 1976

A procedure is presented for the analysis of rating data with correlated intrajudge and uncorrelated interjudge measurement errors. Correlations between true scores on different rating dimensions, reliabilities for each judge on each dimension and correlations between intrajudge errors can be estimated given a minimum of three raters and two…

Descriptors: Correlation, Data Analysis, Error of Measurement, Error Patterns

Predicting/Preventing Child Abuse: Value of Utility Maximizing Cutting Scores.

Tsujimoto, Richard N.; Berger, Dale E. – Child Abuse and Neglect: The International Journal, 1988

Two criteria are discussed for determining cutting scores on a predictor variable for identifying cases of likely child abuse--utility maximizing and error minimizing. Utility maximizing is the preferable criterion, as it optimizes the balance between the costs of incorrect decisions and the benefits of correct decisions. (Author/JDD)

Descriptors: Child Abuse, Cost Effectiveness, Cutting Scores, Error of Measurement

Biased Estimators in Explanatory Research: An Empirical Investigation of Mean Error Properties of Ridge Regression.

Peer reviewed

Kennedy, Eugene – Journal of Experimental Education, 1988

Ridge estimates (REs) of population beta weights were compared to ordinary least squares (OLS) estimates through computer simulation to evaluate the use of REs in explanatory research. With fixed predictors, there was some question of the consistency of ridge regression, but with random predictors, REs were superior to OLS. (SLD)

Descriptors: Computer Simulation, Error of Measurement, Estimation (Mathematics), Least Squares Statistics

Standard Errors of Equipercentile Equating for the Common Stem Nonequivalent Populations Design.

Peer reviewed

Jarjoura, David; Kolen, Michael J. – Journal of Educational Statistics, 1985

An equating design in which two groups of examinees from slightly different populations are administered a different test form with a subset of common items is widely used. This paper presents standard errors and a simulation that verifies the equation for large samples for an equipercentile equating procedure for this design. (Author/BS)

Descriptors: Computer Simulation, Equated Scores, Error of Measurement, Estimation (Mathematics)

Assessing the Reliability of Criterion-Referenced Measures Used to Evaluate Health-Education Programs.

Peer reviewed

Schaeffer, Gary A.; And Others – Evaluation Review, 1986

The reliability of criterion-referenced tests (CRTs) used in health program evaluation can be conceptualized in different ways. Formulas are presented for estimating appropriate standard error of measurement (SEM) for CRTs. The SEM can be used in computing confidence intervals for domain score estimates and for a cut-score. (Author/LMO)

Descriptors: Accountability, Criterion Referenced Tests, Cutting Scores, Error of Measurement

Demonstrating the Reliability of the Difference Score in the Measurement of Change.

Peer reviewed

Rogosa, David R.; Willett, John B. – Journal of Educational Measurement, 1983

Demonstrating good reliability for the difference score in measurement, the results of this study indicate that the difference score is often highly reliable when the correlation between true change and true initial status is nonnegative. In general, when individual differences in true change are appreciable, the difference score shows strong…

Descriptors: Achievement Gains, Error of Measurement, Individual Differences, Measurement Techniques

Standard Errors of Measurement at Different Ability Levels.

Peer reviewed

Lord, Frederic M. – Journal of Educational Measurement, 1984

Four methods are outlined for estimating or approximating from a single test administration the standard error of measurement of number-right test score at specified ability levels or cutting scores. The methods are illustrated and compared on one set of real test data. (Author)

Descriptors: Academic Ability, Cutting Scores, Error of Measurement, Scoring Formulas

Multidimensionality and the Equating of a Mixed-Format Math Examination.

Download full text

Sykes, Robert C.; Hou, Liling; Hanson, Brad; Wang, Zhen – 2002

This study investigated the effect on student scores of using anchor sets that differed in dimensionality in item response theory (IRT) scaled tests. Real data from a mathematics achievement test that had been documented to have dimensions aligned with item format were used. Item responses were available from a representative sample of…

Descriptors: Elementary School Students, Equated Scores, Error of Measurement, Intermediate Grades

Effects of Scale Transformation and Test Termination Rule on the Precision of Ability Estimates in CAT. ACT Research Report Series.

Download full text

Yi, Qing; Wang, Tianyou; Ban, Jae-Chun – 2000

Error indices (bias, standard error of estimation, and root mean square error) obtained on different scales of measurement under different test termination rules in a computerized adaptive test (CAT) context were examined. Four ability estimation methods were studied: (1) maximum likelihood estimation (MLE); (2) weighted likelihood estimation…

Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Error of Measurement

A Reply to Harris's "An Interpretation of Livingston's Reliability Coefficient for Criterion-Referenced Tests"

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1972

This article is a reply to a previous paper (see TM 500 488) interpreting Livingston's original article (see TM 500 487). (CK)

Descriptors: Criterion Referenced Tests, Error of Measurement, Norm Referenced Tests, Test Construction

On the Theory of a Set of Tests Which Differ Only in Length

Peer reviewed

Kristof, Walter – Psychometrika, 1971

Descriptors: Cognitive Measurement, Error of Measurement, Mathematical Models, Psychological Testing

Bayesian Inference and the Classical Test Theory Model: Reliability and True Scores

Peer reviewed

Novick, Melvin R.; And Others – Psychometrika, 1971

Descriptors: Analysis of Variance, Bayesian Statistics, Error of Measurement, Mathematical Models

A Comparison of Two Approaches to Criterion-Referenced Test Construction.

Peer reviewed

Haladyna, Thomas M.; Roid, Gale H. – Journal of Educational Measurement, 1983

The present study showed that Rasch-based adaptive tests--when item domains were finite and specifiable--had greater precision in domain score estimation than test forms created by random sampling of items. Results were replicated across four data sources representing a variety of criterion-referenced, domain-based tests varying in length.…

Descriptors: Adaptive Testing, Criterion Referenced Tests, Error of Measurement, Estimation (Mathematics)

« Previous Page | Next Page »

Pages: 1 | ... | 178 | 179 | 180 | 181 | 182 | 183 | 184 | 185 | 186 | ... | 221

Educational and Psychological…	259
Journal of Educational…	115
ProQuest LLC	95
Applied Psychological…	85
Journal of Educational and…	85
Psychometrika	82
Structural Equation Modeling:…	76
Grantee Submission	71
Journal of Experimental…	70
ETS Research Report Series	59
Multivariate Behavioral…	54
Applied Measurement in…	50
Sociological Methods &…	47
Journal of Psychoeducational…	37
Psychological Methods	33
Society for Research on…	33
Educational Measurement:…	32
Research Synthesis Methods	32
Online Submission	29
Practical Assessment,…	27
International Journal of…	26
Journal of Educational…	26
National Center for Education…	25
Psychology in the Schools	25
Structural Equation Modeling	23
More ▼

Journal Articles	2358
Reports - Research	1905
Reports - Evaluative	703
Reports - Descriptive	344
Speeches/Meeting Papers	329
Dissertations/Theses -…	95
Numerical/Quantitative Data	86
Opinion Papers	77
Information Analyses	72
Tests/Questionnaires	47
Guides - Non-Classroom	27
Guides - Classroom - Teacher	12
Book/Product Reviews	10
Reports - General	9
ERIC Publications	8
ERIC Digests in Full Text	7
Guides - General	7
Books	6
Guides - Classroom - Learner	4
Collected Works - General	3
Legal/Legislative/Regulatory…	3
Historical Materials	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Collected Works - Serials	1
More ▼

Program for International…	45
National Assessment of…	40
SAT (College Admission Test)	24
Trends in International…	24
ACT Assessment	20
Wechsler Intelligence Scale…	20
Early Childhood Longitudinal…	19
Wechsler Adult Intelligence…	12
Iowa Tests of Basic Skills	10
Schools and Staffing Survey…	10
Test of English as a Foreign…	9
Child Behavior Checklist	7
Graduate Record Examinations	7
National Longitudinal Survey…	7
Progress in International…	7
Beck Depression Inventory	6
Advanced Placement…	5
Armed Services Vocational…	5
Cognitive Abilities Test	5
Longitudinal Surveys of…	5
National Household Education…	5
Rosenberg Self Esteem Scale	5
Dynamic Indicators of Basic…	4
Law School Admission Test	4
Motivated Strategies for…	4
More ▼