ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	6

Source

Applied Psychological…	2
Journal of Educational…	2
Advances in Physiology…	1
Education and the Public…	1
Evaluation and the Health…	1
GED Testing Service	1
Mathematica Policy Research,…	1

Author

Bergstrom, Betty A.	1
Chang, Yu-Wen	1
Davison, Mark L.	1
He, Yi	1
Hock, Heinrich	1
Isenberg, Eric	1
Kluge, Annette	1
Li, Yuan H.	1
Lissitz, Robert W.	1
Lunz, Mary E.	1
Ole J. Kemi	1
Qualls-Payne, Audrey L.	1
Reardon, Sean F.	1
Setzer, J. Carl	1
van der Linden, Wim J.	1
More ▼

Publication Type

Reports - Evaluative	10
Journal Articles	6
Reports - Research	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	3
Higher Education	1
Postsecondary Education	1

Audience

Location

District of Columbia	1
Germany	1
New York	1

Laws, Policies, & Programs

Assessments and Surveys

General Educational…	1
Iowa Tests of Basic Skills	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Evidence-Based Evaluation of Student and Marker Performances in Assessment and Examination

Peer reviewed

Direct link

Ole J. Kemi – Advances in Physiology Education, 2025

Students are assessed by coursework and/or exams, all of which are marked by assessors (markers). Student and marker performances are then subject to end-of-session board of examiner handling and analysis. This occurs annually and is the basis for evaluating students but also the wider learning and teaching efficiency of an academic institution.…

Descriptors: Undergraduate Students, Evaluation Methods, Evaluation Criteria, Academic Standards

Design of Value-Added Models for IMPACT and TEAM in DC Public Schools, 2010-2011 School Year. Final Report

Download full text

Isenberg, Eric; Hock, Heinrich – Mathematica Policy Research, Inc., 2011

This report presents the value-added models that will be used to measure school and teacher effectiveness in the District of Columbia Public Schools (DCPS) in the 2010-2011 school year. It updates the earlier technical report, "Measuring Value Added for IMPACT and TEAM in DC Public Schools." The earlier report described the methods used…

Descriptors: Public Schools, Teacher Effectiveness, School Effectiveness, Models

Reliability Analysis for the Internationally Administered 2002 Series GED Tests. GED Testing Service[R] Research Studies, 2009-3

Download full text

Setzer, J. Carl; He, Yi – GED Testing Service, 2009

Reliability Analysis for the Internationally Administered 2002 Series GED (General Educational Development) Tests Reliability refers to the consistency, or stability, of test scores when the authors administer the measurement procedure repeatedly to groups of examinees (American Educational Research Association [AERA], American Psychological…

Descriptors: Educational Research, Error of Measurement, Scores, Test Reliability

Performance Assessments with Microworlds and Their Difficulty

Peer reviewed

Direct link

Kluge, Annette – Applied Psychological Measurement, 2008

The use of microworlds (MWs), or complex dynamic systems, in educational testing and personnel selection is hampered by systematic measurement errors because these new and innovative item formats are not adequately controlled for their difficulty. This empirical study introduces a way to operationalize an MW's difficulty and demonstrates the…

Descriptors: Personnel Selection, Self Efficacy, Educational Testing, Computer Uses in Education

Review of "How New York City's Charter Schools Affect Achievement"

Download full text

Reardon, Sean F. – Education and the Public Interest Center, 2009

"How New York City's Charter Schools Affect Achievement" estimates the effects on student achievement of attending a New York City charter school rather than a traditional public school and investigates the characteristics of charter schools associated with the most positive effects on achievement. Because the report relies on an…

Descriptors: Charter Schools, Academic Achievement, Achievement Gains, Achievement Rating

A Comparison of Score Level Estimates of the Standard Error of Measurement.

Peer reviewed

Qualls-Payne, Audrey L. – Journal of Educational Measurement, 1992

Six methods for estimating the standard error of measurement (SEM) at specific score levels are compared by comparing score level SEM estimates from a single test administration to estimates from two test administrations, using Iowa Tests of Basic Skills data for 2,138 examinees. L. S. Feldt's method is preferred. (SLD)

Descriptors: Comparative Testing, Elementary Education, Elementary School Students, Error of Measurement

Equating Scores from Adaptive to Linear Tests

Peer reviewed

Direct link

van der Linden, Wim J. – Applied Psychological Measurement, 2006

Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Format, Equated Scores

A Comparison of Unidimensional and Multidimensional IRT Approaches to Test Information in a Test Battery.

Download full text

Chang, Yu-Wen; Davison, Mark L. – 1992

Standard errors and bias of unidimensional and multidimensional ability estimates were compared in a factorial, simulation design with two item response theory (IRT) approaches, two levels of test correlation (0.42 and 0.63), two sample sizes (500 and 1,000), and a hierarchical test content structure. Bias and standard errors of subtest scores…

Descriptors: Comparative Testing, Computer Simulation, Correlation, Error of Measurement

Confidence in Pass/Fail Decisions for Computer Adaptive and Paper and Pencil Examinations.

Peer reviewed

Bergstrom, Betty A.; Lunz, Mary E. – Evaluation and the Health Professions, 1992

The level of confidence in pass/fail decisions obtained with computerized adaptive tests and paper-and-pencil tests was greater for 645 medical technology students when the computer adaptive test implemented a 90 percent confidence stopping rule than for paper-and-pencil tests of comparable length. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Confidence Testing

Applications of the Analytically Derived Asymptotic Standard Errors of Item Response Theory Item Parameter Estimates

Peer reviewed

Direct link

Li, Yuan H.; Lissitz, Robert W. – Journal of Educational Measurement, 2004

The analytically derived asymptotic standard errors (SEs) of maximum likelihood (ML) item estimates can be approximated by a mathematical function without examinees' responses to test items, and the empirically determined SEs of marginal maximum likelihood estimation (MMLE)/Bayesian item estimates can be obtained when the same set of items is…

Descriptors: Test Items, Computation, Item Response Theory, Error of Measurement

Comparative Testing	10
Error of Measurement	10
Scores	5
Evaluation Methods	4
Estimation (Mathematics)	3
Item Response Theory	3
Academic Standards	2
Achievement Gains	2
Adaptive Testing	2
Comparative Analysis	2
Computer Assisted Testing	2
Computer Simulation	2
Correlation	2
Equated Scores	2
Evaluation Criteria	2
Mathematical Models	2
Measurement Techniques	2
Research Reports	2
Robustness (Statistics)	2
School Effectiveness	2
Standardized Tests	2
Statistical Bias	2
Test Format	2
Test Reliability	2
Academic Achievement	1
More ▼