ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	8

Descriptor

Educational Testing	24
Models	24
Test Validity	12
Validity	11
Test Construction	7
Evaluation Methods	6
Reliability	5
Student Evaluation	5
Academic Achievement	4
Computer Assisted Testing	4
Educational Assessment	4
Program Effectiveness	4
Test Interpretation	4
Academic Ability	3
Achievement Tests	3
Adaptive Testing	3
High Stakes Tests	3
Item Response Theory	3
Measurement Techniques	3
Psychometrics	3
Standardized Tests	3
Test Bias	3
Test Reliability	3
Testing Programs	3
Community Colleges	2
More ▼

Source

Journal of Educational…	2
ProQuest LLC	2
Educational Assessment	1
Educational Research	1
Educational Research and…	1
Journal of Applied Research…	1
Journal of Experimental…	1
Measurement:…	1
Multivariate Behavioral…	1
Online Submission	1
Regional Educational…	1
School Psychology Digest	1
Sociology of Education	1
Teacher	1
Theory and Research in…	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	6
Reports - Evaluative	5
Reports - Descriptive	4
Information Analyses	3
Opinion Papers	3
Collected Works - Proceedings	2
Dissertations/Theses -…	2
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	3
Adult Education	2
High Schools	1
Postsecondary Education	1

Audience

Researchers

Location

Canada	1
Florida	1
Louisiana	1
Minnesota	1
New York	1
North Carolina	1
Tennessee	1
Texas	1
United Kingdom	1
Utah	1
Wisconsin	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Alberta Grade Twelve Diploma…	1
Comprehensive Tests of Basic…	1
Graduate Record Examinations	1
System of Multicultural…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

Peer reviewed

Direct link

Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Multivariate Behavioral Research, 2010

Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…

Descriptors: Educational Testing, Scores, Reports, Psychometrics

Controlling Type I Error Rate in Evaluating Differential Item Functioning for Four DIF Methods: Use of Three Procedures for Adjustment of Multiple Item Testing

Direct link

Kim, Jihye – ProQuest LLC, 2010

In DIF studies, a Type I error refers to the mistake of identifying non-DIF items as DIF items, and a Type I error rate refers to the proportion of Type I errors in a simulation study. The possibility of making a Type I error in DIF studies is always present and high possibility of making such an error can weaken the validity of the assessment.…

Descriptors: Test Bias, Test Length, Simulation, Testing

A Mixture-Modeling Approach to Exploring Test-Taking Motivation in Large-Scale Low-Stakes Contexts

Direct link

Horst, S. Jeanne – ProQuest LLC, 2010

Despite high-stakes applications of assessment findings, assessment data are frequently collected in situations that are of low-stakes to examinees. Because low-stakes tests are of little consequence to the examinees, test-taking motivation and thus the validity of inferences drawn from unmotivated examinees' scores are of concern. The current…

Descriptors: Personality Traits, Motivation, Personality, Data Analysis

Advantages of the Rasch Measurement Model in Analysing Educational Tests: An Applicator's Reflection

Peer reviewed

Direct link

Tormakangas, Kari – Educational Research and Evaluation, 2011

Educational achievement is a very important issue for parents, teachers, and the government. An accurate measurement plays a very important role in evaluating achievement fairly, and, therefore, analysis methods have been developed considerably in recent years. Education based on long-time learning processes forms a fruitful base for item tests,…

Descriptors: Test Items, Item Analysis, Learning Processes, Item Response Theory

Evidence Based Education Request Desk. EBE #500

Peer reviewed
PDF on ERIC

Download full text

Regional Educational Laboratory Southeast, 2009

Since the passage of the No Child Left Behind Act of 2001 (2002), there has been increased interest in using student achievement data (through standardized tests) to evaluate teacher effectiveness. Two U.S. Department of Education secretaries, Secretary Spellings and Secretary Duncan, have expressed interest in growth models and the need to…

Descriptors: Evidence, Educational Research, Teacher Effectiveness, Teacher Evaluation

How Much Can We Reliably Know about What Examinees Know?

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.

Descriptors: Scoring, Reliability, Validity, Classification

Model-Based Assessments to Support Learning and Accountability: The Evolution of CRESST's Research on Multiple-Purpose Measures

Peer reviewed

Direct link

Baker, Eva L. – Educational Assessment, 2007

This article describes the history, evidence warrants, and evolution of the Center for Research on Evaluation, Standards, and Student Testing's (CRESST) model-based assessments. It considers alternative interpretations of scientific or practical models and illustrates how model-based assessment addresses both definitions. The components of the…

Descriptors: Educational Testing, Computer Assisted Testing, Validity, Test Construction

Occupational Education Research Project. A Model for Evaluation of Placement Testing in the North Carolina Community College System.

Tripp, John D.; Todd, Anne H. – 1982

A project was conducted to develop a model for evaluating placement testing in the North Carolina System of Community Colleges. Researchers at Central Piedmont Community College conducted a longitudinal study of students' progress through the college curriculum as it related to placement test scores. The following numbers of students comprised the…

Descriptors: Academic Achievement, Community Colleges, Educational Testing, Equivalency Tests

Untangling Testing

Bloomer, Corinne – Teacher, 1975

Article discussed the disadvantages of student testing as a means of evaluating student progress in the classroom and suggested the use of a new model of assessment. Three steps intended for classroom diagnosis of students were described. (RK)

Descriptors: Academic Achievement, Educational Testing, Models, Standardized Tests

Considering Alternatives to National Assessment Arrangements in England: Possibilities and Opportunities

Peer reviewed

Direct link

Green, Sylvia; Oates, Tim – Educational Research, 2009

Background: In this article we address some of the challenges posed by the development of national assessment systems and discuss the need for high quality information on trends in attainment; support for school improvement processes and ways in which learning should be enhanced through valid assessment. Purpose: Key elements are explored,…

Descriptors: Educational Objectives, National Standards, Educational Quality, Educational Change

A Framework for Analyzing the Inference Structure of Educational Achievement Tests.

Peer reviewed

Wardrop, James L.; And Others – Journal of Educational Measurement, 1982

A structure for describing different approaches to testing is generated by identifying five dimensions along which tests differ: test uses, item generation, item revision, assessment of precision, and validation. These dimensions are used to profile tests of reading comprehension. Only norm-referenced achievement tests had an inference system…

Descriptors: Achievement Tests, Comparative Analysis, Educational Testing, Models

Determination of Optimal Cutting Scores in Criterion-Referenced Measurement

Peer reviewed

Berk, Ronald A. – Journal of Experimental Education, 1976

Attempts to select empirically the optimal cutting score or criterion level for a test based on response data from validation samples of instructed and uninstructed students. This score maximizes the probability of correct mastery-nonmastery decisions (or minimizes the probability of incorrect decisions). (Author/RK)

Descriptors: Charts, Criterion Referenced Tests, Cutting Scores, Educational Testing

Evaluating Comparability in Computerized Adaptive Testing: Issues, Criteria and an Example.

Peer reviewed

Wang, Tianyou; Kolen, Michael J. – Journal of Educational Measurement, 2001

Reviews research literature on comparability issues in computerized adaptive testing (CAT) and synthesizes issues specific to comparability and test security. Develops a framework for evaluating comparability that contains three categories of criteria: (1) validity; (2) psychometric property/reliability; and (3) statistical assumption/test…

Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Criteria

Graphical Models and Computerized Adaptive Testing.

Download full text

Mislevy, Robert J.; Almond, Russell G. – 1997

This paper synthesizes ideas from the fields of graphical modeling and education testing, particularly item response theory (IRT) applied to computerized adaptive testing (CAT). Graphical modeling can offer IRT a language for describing multifaceted skills and knowledge, and disentangling evidence from complex performances. IRT-CAT can offer…

Descriptors: Adaptive Testing, Computer Assisted Testing, Educational Testing, Higher Education

Construct Validity in Psychological Measurement; Proceedings of a Colloquium on Theory and Application in Education and Employment (Henry Chauncey Conference Center, Princeton, New Jersey, October 1979).

Office of Personnel Management, Washington, DC. – 1979

The stimulus for this colloquium was the convergence of several significant developments bearing on the construct validation of standardized tests and other assessment methods. Of these developments, some were fundamental to psychology as a science; others reflected socio-political pressures on measurement in education and employment. The ten…

Descriptors: Aptitude Tests, Educational Practices, Educational Testing, Employment Practices

Previous Page | Next Page »

Pages: 1 | 2

Haberman, Shelby J.	2
Sinharay, Sandip	2
Almond, Russell G.	1
Arter, Judith A.	1
Baker, Eva L.	1
Berk, Ronald A.	1
Bloomer, Corinne	1
Ernest, Patricia S.	1
Estes, Gary D.	1
Green, Sylvia	1
Hansen, Duncan N.	1
Horst, S. Jeanne	1
Jencks, Christopher	1
Kim, Jihye	1
Kolen, Michael J.	1
Leighton, Jacqueline P.	1
Leitzel, Thomas C.	1
McCowan, Richard J.	1
McCowan, Sheila C.	1
Mercer, Jane R.	1
Mislevy, Robert J.	1
Moreland, Kevin L.	1
Norris, Stephen P.	1
Oates, Tim	1
More ▼