ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	56

Descriptor

Educational Testing	184
Measurement Techniques	184
Evaluation Methods	57
Educational Assessment	52
Test Construction	49
Student Evaluation	40
Psychometrics	30
Elementary Secondary Education	29
Academic Achievement	28
Measurement	27
Models	27
Test Use	26
Testing Problems	26
Achievement Tests	25
Comparative Analysis	22
Test Interpretation	22
Standardized Tests	21
Statistical Analysis	20
Testing	20
Higher Education	19
Item Response Theory	19
Test Items	19
Test Reliability	18
Test Validity	18
Computer Assisted Testing	17
More ▼

Education Level

Elementary Secondary Education	28
Higher Education	10
Postsecondary Education	6
Elementary Education	4
Grade 8	2
Secondary Education	2
Adult Education	1
Early Childhood Education	1
Grade 1	1
Grade 2	1
Grade 3	1
Grade 4	1
Kindergarten	1
More ▼

Audience

Practitioners	7
Researchers	6
Teachers	4
Administrators	2
Students	2
Policymakers	1

Location

United Kingdom	9
United States	6
United Kingdom (England)	4
New York	3
California	2
New Jersey	2
United Kingdom (Wales)	2
Australia	1
Canada	1
Colombia	1
Florida	1
Illinois	1
Indiana	1
Kansas	1
Minnesota	1
Minnesota (Minneapolis)	1
Nebraska	1
North Carolina	1
Oklahoma	1
Switzerland (Geneva)	1
Taiwan	1
Tennessee	1
Texas	1
Thailand	1
Utah	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Individuals with Disabilities…	2
Elementary and Secondary…	1

What Works Clearinghouse Rating

Meets WWC Standards with or without Reservations

Showing 1 to 15 of 184 results Save | Export

Inaccurate Individual Ability Estimates with Three-Parameter Item Response Models in Mixture Settings

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A.; Huber, Chuck – Measurement: Interdisciplinary Research and Perspectives, 2020

It is demonstrated that the popular three-parameter logistic model can lead to markedly inaccurate individual ability level estimates for mixture populations. A theoretically and empirically important setting is initially considered where (a) in one of two subpopulations (latent classes) the two-parameter logistic model holds for each item in a…

Descriptors: Item Response Theory, Models, Measurement Techniques, Item Analysis

Fairness in Measurement and Selection: Statistical, Philosophical, and Public Perspectives

Peer reviewed

Direct link

Zwick, Rebecca – Educational Measurement: Issues and Practice, 2019

Selection decisions have a major impact on our education, occupation, and quality of life, and the role of standardized tests in selection has always been a source of controversy. Here, I consider various definitions of fairness in measurement and selection--those emerging from within educational measurement and statistics, those from philosophy,…

Descriptors: Culture Fair Tests, Decision Making, Standardized Tests, Selection Criteria

Examination of the Parameter Estimate Bias When Violating the Orthogonality Assumption of the Bifactor Model

Direct link

Zheng, Chunmei – ProQuest LLC, 2013

Educational and psychological constructs are normally measured by multifaceted dimensions. The measured construct is defined and measured by a set of related subdomains. A bifactor model can accurately describe such data with both the measured construct and the related subdomains. However, a limitation of the bifactor model is the orthogonality…

Descriptors: Educational Testing, Measurement Techniques, Test Items, Models

A Multicomponent Latent Trait Model for Diagnosis

Peer reviewed

Direct link

Embretson, Susan E.; Yang, Xiangdong – Psychometrika, 2013

This paper presents a noncompensatory latent trait model, the multicomponent latent trait model for diagnosis (MLTM-D), for cognitive diagnosis. In MLTM-D, a hierarchical relationship between components and attributes is specified to be applicable to permit diagnosis at two levels. MLTM-D is a generalization of the multicomponent latent trait…

Descriptors: Mathematics Achievement, Achievement Tests, Item Response Theory, Measurement

Development of a Hybrid Method for Dimensionality Identification Incorporating an Angle-Based Approach

Direct link

Zeng, Ji – ProQuest LLC, 2010

Correct dimensionality identification (i.e., a correct decision on the number of factors to retain) is crucial not only in educational and psychological measurement, but also in various fields such as medicine and sociology that use exploratory factor analysis (EFA) in developing theories. However, to date, no single method has been endorsed for…

Descriptors: Measurement Techniques, Identification, Factor Analysis, Psychology

A Comparison of Equating/Linking Using the Stocking-Lord Method and Concurrent Calibration with Mixed-Format Tests in the Non-Equivalent Groups Common-Item Design under IRT

Direct link

Tian, Feng – ProQuest LLC, 2011

There has been a steady increase in the use of mixed-format tests, that is, tests consisting of both multiple-choice items and constructed-response items in both classroom and large-scale assessments. This calls for appropriate equating methods for such tests. As Item Response Theory (IRT) has rapidly become mainstream as the theoretical basis for…

Descriptors: Item Response Theory, Comparative Analysis, Equated Scores, Statistical Analysis

A Case of Inconsistent Equatings: How the Man with Four Watches Decides What Time It Is

Peer reviewed

Direct link

Livingston, Samuel A.; Antal, Judit – Applied Measurement in Education, 2010

A simultaneous equating of four new test forms to each other and to one previous form was accomplished through a complex design incorporating seven separate equating links. Each new form was linked to the reference form by four different paths, and each path produced a different score conversion. The procedure used to resolve these inconsistencies…

Descriptors: Measurement Techniques, Measurement, Educational Assessment, Educational Testing

Aberrant Response Patterns as a Multidimensional Phenomenon: Using Factor-Analytic Model Comparison to Detect Cheating

Direct link

Clark, John Michael, III. – ProQuest LLC, 2010

This dissertation proposes a new factor-analytic technique for detecting cheating on exams. Person-fit statistics have been developed to assess the extent to which examinees' response patterns are consistent with expectation, with expectation defined in the context of some model. Response patterns that are inconsistent with expectation are said to…

Descriptors: Evidence, Expectation, Item Response Theory, Factor Analysis

A Comparison of Computer-Based Classification Testing Approaches Using Mixed-Format Tests with the Generalized Partial Credit Model

Direct link

Kim, Jiseon – ProQuest LLC, 2010

Classification testing has been widely used to make categorical decisions by determining whether an examinee has a certain degree of ability required by established standards. As computer technologies have developed, classification testing has become more computerized. Several approaches have been proposed and investigated in the context of…

Descriptors: Test Length, Computer Assisted Testing, Classification, Probability

The Long-Term Impacts of Teachers: Teacher Value-Added and Student Outcomes in Adulthood. NBER Working Paper No. 17699

Direct link

Raj Chetty; John N. Friedman; Jonah E. Rockoff – National Bureau of Economic Research, 2011

Are teachers' impacts on students' test scores ("value-added") a good measure of their quality? This question has sparked debate largely because of disagreement about (1) whether value-added (VA) provides unbiased estimates of teachers' impacts on student achievement and (2) whether high-VA teachers improve students' long-term outcomes.…

Descriptors: Academic Achievement, Scores, Teacher Effectiveness, Outcomes of Education

Comparability of Paper-and-Pencil and Computer-Based Cognitive and Non-Cognitive Measures in a Low-Stakes Testing Environment

Direct link

Rowan, Barbara E. – ProQuest LLC, 2010

Computerized versions of paper-and-pencil tests (PPT) have emerged over the past few decades, and some practitioners are using both formats concurrently. But computerizing a PPT may not yield equivalent scores across the two administration modes. Comparability studies are required to determine if the scores are equivalent before treating them as…

Descriptors: Computer Assisted Testing, Factor Structure, Program Effectiveness, Scores

A Comparison of IRT Linking Procedures

Peer reviewed

Direct link

Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010

Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…

Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques

Using and Developing Measurement Instruments in Science Education: A Rasch Modeling Approach. Science & Engineering Education Sources

Direct link

Liu, Xiufeng – IAP - Information Age Publishing, Inc., 2010

This book meets a demand in the science education community for a comprehensive and introductory measurement book in science education. It describes measurement instruments reported in refereed science education research journals, and introduces the Rasch modeling approach to developing measurement instruments in common science assessment domains,…

Descriptors: Graduate Students, Textbooks, Research Methodology, Science Tests

Group Comparisons of Mathematics Performance from a Cognitive Diagnostic Perspective

Peer reviewed

Direct link

Chen, Yi-Hsin; Ferron, John M.; Thompson, Marilyn S.; Gorin, Joanna S.; Tatsuoka, Kikumi K. – Educational Research and Evaluation, 2010

Traditional comparisons of test score means identify group differences in broad academic areas, but fail to provide substantive description of how the groups differ on the specific cognitive attributes required for success in the academic area. The rule space method (RSM) allows for group comparisons at the cognitive attribute level, which…

Descriptors: Foreign Countries, Academic Achievement, Probability, Algebra

Graded Response Model Based on the Logistic Positive Exponent Family of Models for Dichotomous Responses

Peer reviewed

Direct link

Samejima, Fumiko – Psychometrika, 2008

Samejima ("Psychometrika "65:319--335, 2000) proposed the logistic positive exponent family of models (LPEF) for dichotomous responses in the unidimensional latent space. The objective of the present paper is to propose and discuss a graded response model that is expanded from the LPEF, in the context of item response theory (IRT). This…

Descriptors: Psychological Testing, Item Response Theory, Psychometrics, Educational Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13

Measurement:…	13
ProQuest LLC	10
Educational Measurement:…	7
Popular Measurement	4
Applied Measurement in…	3
Applied Psychological…	3
Educational Research and…	3
Educational and Psychological…	3
Evaluation in Education:…	3
Journal of Educational…	3
National Center for Analysis…	3
Psychometrika	3
Educational Testing Service	2
New Directions for…	2
Research Papers in Education	2
Review of Research in…	2
School Science Review	2
American Journal of Education	1
American Psychologist	1
American Vocational Journal	1
Art Education	1
Bureau of Education,…	1
Business Education World	1
College Board Review	1
College Student Journal	1
More ▼

Wright, Benjamin D.	3
van der Linden, Wim J.	3
Brennan, Robert L.	2
Embretson, Susan E.	2
Harris, Douglas N.	2
Newton, Paul E.	2
Samejima, Fumiko	2
Volkwein, J. Fredericks	2
Yin, Alexander C.	2
AMRAM, FRED M.	1
Abedi, Jamal	1
Achterberg, James E.	1
Airaisian, Peter W.	1
Airasian, Peter W.	1
Albus, Debra	1
Allal, Linda K.	1
Almond, Russell G.	1
Angoff, William H.	1
Antal, Judit	1
Arismendi-Pardi, E. J.	1
Bagnato, Stephen J.	1
Baird, Jo-Anne	1
Baird, Leonard L., Ed.	1
More ▼

Journal Articles	76
Reports - Evaluative	32
Reports - Research	32
Speeches/Meeting Papers	25
Opinion Papers	23
Reports - Descriptive	21
Dissertations/Theses -…	10
Information Analyses	9
Books	8
Collected Works - General	5
Guides - Non-Classroom	4
Historical Materials	4
Book/Product Reviews	3
Guides - General	3
Non-Print Media	2
Collected Works - Proceedings	1
Collected Works - Serials	1
Dissertations/Theses -…	1
ERIC Publications	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

SAT (College Admission Test)	7
Advanced Placement…	3
National Assessment of…	3
ACT Assessment	2
College Level Examination…	2
Test of English as a Foreign…	2
Bender Gestalt Test	1
Collegiate Assessment of…	1
Defining Issues Test	1
Differential Aptitude Test	1
Gates MacGinitie Reading Tests	1
General Aptitude Test Battery	1
Graduate Record Examinations	1
Holland Vocational Preference…	1
Metropolitan Readiness Tests	1
Minnesota Multiphasic…	1
National Assessment of Adult…	1
Nelson Denny Reading Tests	1
Sequential Tests of…	1
Stanford Achievement Tests	1
Strong Campbell Interest…	1
Wechsler Intelligence Scale…	1
More ▼