ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	6

Source

Educational Measurement:…

Author

April L. Zenisky	1
Dorans, Neil J.	1
Eignor, Daniel R.	1
Frey, Andreas	1
Hartig, Johannes	1
Horbach, Andrea	1
Javier Suárez-Álvarez	1
Kolen, Michael J.	1
Lee, Won-Chan	1
Liu, Jinghua	1
Maria Elena Oliveri	1
Rupp, Andre A.	1
Stephen G. Sireci	1
Wainer, Howard	1
Zehner, Fabian	1
Zesch, Torsten	1
More ▼

Publication Type

Journal Articles	7
Reports - Descriptive	7

Education Level

Adult Education	1
Elementary Secondary Education	1

Audience

Location

Canada	1
Israel	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Showing all 7 results Save | Export

To Score or Not to Score: Factors Influencing Performance and Feasibility of Automatic Content Scoring of Text Responses

Peer reviewed

Direct link

Zesch, Torsten; Horbach, Andrea; Zehner, Fabian – Educational Measurement: Issues and Practice, 2023

In this article, we systematize the factors influencing performance and feasibility of automatic content scoring methods for short text responses. We argue that performance (i.e., how well an automatic system agrees with human judgments) mainly depends on the linguistic variance seen in the responses and that this variance is indirectly influenced…

Descriptors: Influences, Academic Achievement, Feasibility Studies, Automation

Evolving Educational Testing to Meet Students' Needs: Design-in-Real-Time Assessment

Peer reviewed

Direct link

Stephen G. Sireci; Javier Suárez-Álvarez; April L. Zenisky; Maria Elena Oliveri – Educational Measurement: Issues and Practice, 2024

The goal in personalized assessment is to best fit the needs of each individual test taker, given the assessment purposes. Design-in-Real-Time (DIRTy) assessment reflects the progressive evolution in testing from a single test, to an adaptive test, to an adaptive assessment "system." In this article, we lay the foundation for DIRTy…

Descriptors: Educational Assessment, Student Needs, Test Format, Test Construction

Assessing a Critical Aspect of Construct Continuity when Test Specifications Change or Test Forms Deviate from Specifications

Peer reviewed

Direct link

Liu, Jinghua; Dorans, Neil J. – Educational Measurement: Issues and Practice, 2013

We make a distinction between two types of test changes: inevitable deviations from specifications versus planned modifications of specifications. We describe how score equity assessment (SEA) can be used as a tool to assess a critical aspect of construct continuity, the equivalence of scores, whenever planned changes are introduced to testing…

Descriptors: Tests, Test Construction, Test Format, Change

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Peer reviewed

Direct link

Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores

NCME 2007 Presidential Address: The Concordance Table--An Invitation to Misuse Test Scores

Peer reviewed

Direct link

Eignor, Daniel R. – Educational Measurement: Issues and Practice, 2008

This article discusses a particular type of concordance table and the potential for test score misuse that may result from employing such a table. The concordance that is discussed is typically created between scores on different, nonequatable versions of a test that share the same or close to the same test title. These concordance tables often…

Descriptors: Scores, Tables (Data), Comparative Analysis, Equated Scores

An NCME Instructional Module on Booklet Designs in Large-Scale Assessments of Student Achievement: Theory and Practice

Peer reviewed

Direct link

Frey, Andreas; Hartig, Johannes; Rupp, Andre A. – Educational Measurement: Issues and Practice, 2009

In most large-scale assessments of student achievement, several broad content domains are tested. Because more items are needed to cover the content domains than can be presented in the limited testing time to each individual student, multiple test forms or booklets are utilized to distribute the items to the students. The construction of an…

Descriptors: Measures (Individuals), Test Construction, Theory Practice Relationship, Design

Comparing the Incomparable: An Essay on the Importance of Big Assumptions and Scant Evidence.

Peer reviewed

Wainer, Howard – Educational Measurement: Issues and Practice, 1999

Discusses the comparison of groups of individuals who were administered different forms of a test. Focuses on the situation in which there is little overlap in content between the test forms. Reviews equating problems in national tests in Canada and Israel. (SLD)

Descriptors: Comparative Analysis, Equated Scores, Foreign Countries, National Competency Tests

Test Format	7
Comparative Analysis	3
Equated Scores	3
Test Construction	3
Educational Assessment	2
Scores	2
Test Items	2
Tests	2
Academic Achievement	1
Academic Aspiration	1
Achievement Tests	1
Adaptive Testing	1
Adult Students	1
Automation	1
Change	1
College Entrance Examinations	1
Computer Assisted Testing	1
Culture Fair Tests	1
Delivery Systems	1
Design	1
Design Requirements	1
Educational Innovation	1
Educational Research	1
Educational Testing	1
Elementary Secondary Education	1
More ▼