ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	8

Descriptor

Statistical Analysis	9
Scores	6
Achievement Gains	2
Equated Scores	2
Measurement	2
Research Problems	2
Test Items	2
Test Reliability	2
Test Theory	2
Academic Standards	1
Accountability	1
Basic Skills	1
Cognitive Tests	1
College Readiness	1
College Students	1
Computer Assisted Testing	1
Content Analysis	1
Critical Thinking	1
Cutting Scores	1
Decision Making	1
Diagnostic Tests	1
Discourse Analysis	1
Educational Assessment	1
Educational Indicators	1
Educational Policy	1
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	9
Reports - Research	5
Reports - Descriptive	2
Reports - Evaluative	2

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Education	1
Grade 4	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Canada

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

Do 45% of College Students Lack Critical Thinking Skills? Revisiting a Central Conclusion of "Academically Adrift"

Peer reviewed

Direct link

Lane, David; Oswald, Frederick L. – Educational Measurement: Issues and Practice, 2016

The educational literature, the popular press, and educated laypeople have all echoed a conclusion from the book "Academically Adrift" by Richard Arum and Josipa Roksa (which has now become received wisdom), namely, that 45% of college students showed no significant gains in critical thinking skills. Similar results were reported by…

Descriptors: College Students, Critical Thinking, Thinking Skills, Statistical Analysis

The Philosophical Aspects of IRT Equating: Modeling Drift to Evaluate Cohort Growth in Large-Scale Assessments

Peer reviewed

Direct link

Taherbhai, Husein; Seo, Daeryong – Educational Measurement: Issues and Practice, 2013

Calibration and equating is the quintessential necessity for most large-scale educational assessments. However, there are instances when no consideration is given to the equating process in terms of context and substantive realization, and the methods used in its execution. In the view of the authors, equating is not merely an exhibit of the…

Descriptors: Item Response Theory, Equated Scores, Measurement, Educational Assessment

Automated Scoring of Students' Small-Group Discussions to Assess Reading Ability

Peer reviewed

Direct link

Kosh, Audra E.; Greene, Jeffrey A.; Murphy, P. Karen; Burdick, Hal; Firetto, Carla M.; Elmore, Jeff – Educational Measurement: Issues and Practice, 2018

We explored the feasibility of using automated scoring to assess upper-elementary students' reading ability through analysis of transcripts of students' small-group discussions about texts. Participants included 35 fourth-grade students across two classrooms that engaged in a literacy intervention called Quality Talk. During the course of one…

Descriptors: Computer Assisted Testing, Small Group Instruction, Group Discussion, Student Evaluation

Evaluating the Predictive Value of Growth Prediction Models

Peer reviewed

Direct link

Murphy, Daniel L.; Gaertner, Matthew N. – Educational Measurement: Issues and Practice, 2014

This study evaluates four growth prediction models--projection, student growth percentile, trajectory, and transition table--commonly used to forecast (and give schools credit for) middle school students' future proficiency. Analyses focused on vertically scaled summative mathematics assessments, and two performance standards conditions (high…

Descriptors: Prediction, Models, Achievement Gains, Middle School Students

Validating Student Score Inferences with Person-Fit Statistic and Verbal Reports: A Person-Fit Study for Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Cui, Ying; Roberts, Mary Roduta – Educational Measurement: Issues and Practice, 2013

The goal of this study was to investigate the usefulness of person-fit analysis in validating student score inferences in a cognitive diagnostic assessment. In this study, a two-stage procedure was used to evaluate person fit for a diagnostic test in the domain of statistical hypothesis testing. In the first stage, the person-fit statistic, the…

Descriptors: Scores, Validity, Cognitive Tests, Diagnostic Tests

Subscores Based on Classical Test Theory: To Report or Not to Report

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby; Puhan, Gautam – Educational Measurement: Issues and Practice, 2007

There is an increasing interest in reporting subscores, both at examinee level and at aggregate levels. However, it is important to ensure reasonable subscore performance in terms of high reliability and validity to minimize incorrect instructional and remediation decisions. This article employs a statistical measure based on classical test theory…

Descriptors: Test Reliability, Test Theory, Test Validity, Statistical Analysis

A Perspective on the History of Generalizability Theory.

Peer reviewed

Brennan, Robert L. – Educational Measurement: Issues and Practice, 1997

The history of generalizability theory (G theory) is told from the perspective of one researcher's experiences, describing psychometric and scientific perspectives that influenced the development of G theory and its adoption. Work that remains to be done in the field is outlined. (SLD)

Descriptors: Educational Testing, Generalizability Theory, Measurement, Psychometrics

An NCME Instructional Module on Quality Control Procedures in the Scoring, Equating, and Reporting of Test Scores

Peer reviewed

Direct link

Allalouf, Avi – Educational Measurement: Issues and Practice, 2007

There is significant potential for error in long production processes that consist of sequential stages, each of which is heavily dependent on the previous stage, such as the SER (Scoring, Equating, and Reporting) process. Quality control procedures are required in order to monitor this process and to reduce the number of mistakes to a minimum. In…

Descriptors: Scoring, Quality Control, Sequential Approach, Error Correction

Allalouf, Avi	1
Almehrizi, Rashid S.	1
Brennan, Robert L.	1
Burdick, Hal	1
Cui, Ying	1
Elmore, Jeff	1
Firetto, Carla M.	1
Gaertner, Matthew N.	1
Greene, Jeffrey A.	1
Haberman, Shelby	1
Kosh, Audra E.	1
Lane, David	1
Murphy, Daniel L.	1
Murphy, P. Karen	1
Oswald, Frederick L.	1
Puhan, Gautam	1
Roberts, Mary Roduta	1
Seo, Daeryong	1
Sinharay, Sandip	1
Taherbhai, Husein	1
More ▼