ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	2
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	18

Descriptor

Evaluation Methods	42
Test Items	42
Testing	42
Test Construction	11
Models	9
Foreign Countries	8
Scoring	8
Student Evaluation	8
Test Validity	8
Computer Assisted Testing	7
Evaluation Criteria	7
Measurement Techniques	7
Psychometrics	7
Language Tests	6
Classification	5
Comparative Analysis	5
English (Second Language)	5
Measurement	5
Scores	5
Second Language Learning	5
Test Format	5
Test Theory	5
Criterion Referenced Tests	4
Definitions	4
Diagnostic Tests	4
More ▼

Publication Type

Journal Articles	23
Reports - Research	10
Reports - Descriptive	9
Reports - Evaluative	8
Guides - Classroom - Teacher	3
Guides - General	3
Opinion Papers	3
Speeches/Meeting Papers	3
Tests/Questionnaires	3
Guides - Non-Classroom	2
Multilingual/Bilingual…	2
Dissertations/Theses -…	1
Dissertations/Theses -…	1
ERIC Digests in Full Text	1
ERIC Publications	1
Historical Materials	1
Information Analyses	1
Numerical/Quantitative Data	1
More ▼

Education Level

Elementary Secondary Education	2
Grade 4	2
Grade 8	2
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Grade 10	1
Grade 11	1
Grade 12	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 9	1
High Schools	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Practitioners	8
Teachers	5
Administrators	3
Community	1
Parents	1
Students	1

Location

Canada	3
Turkey	2
California	1
Puerto Rico	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 42 results Save | Export

Comparison of Kernel Equating Methods under NEAT and NEC Designs

Peer reviewed
PDF on ERIC

Download full text

Ozsoy, Seyma Nur; Kilmen, Sevilay – International Journal of Assessment Tools in Education, 2023

In this study, Kernel test equating methods were compared under NEAT and NEC designs. In NEAT design, Kernel post-stratification and chain equating methods taking into account optimal and large bandwidths were compared. In the NEC design, gender and/or computer/tablet use was considered as a covariate, and Kernel test equating methods were…

Descriptors: Equated Scores, Testing, Test Items, Statistical Analysis

Modeling NAEP Test-Taking Behavior Using Educational Process Analysis

Peer reviewed
PDF on ERIC

Download full text

Patel, Nirmal; Sharma, Aditya; Shah, Tirth; Lomas, Derek – Journal of Educational Data Mining, 2021

Process Analysis is an emerging approach to discover meaningful knowledge from temporal educational data. The study presented in this paper shows how we used Process Analysis methods on the National Assessment of Educational Progress (NAEP) test data for modeling and predicting student test-taking behavior. Our process-oriented data exploration…

Descriptors: Learning Analytics, National Competency Tests, Evaluation Methods, Prediction

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Releasing Content to Deter Cheating: An Analysis of the Impact on Candidate Performance

Peer reviewed

Direct link

Wolkowitz, Amanda A.; Davis-Becker, Susan L.; Gerrow, Jack D. – Journal of Applied Testing Technology, 2016

The purpose of this study was to investigate the impact of a cheating prevention strategy employed for a professional credentialing exam that involved releasing over 7,000 active and retired exam items. This study evaluated: 1) If any significant differences existed between examinee performance on released versus non-released items; 2) If item…

Descriptors: Cheating, Test Content, Test Items, Foreign Countries

Assessing the Impact of Characteristics of the Test, Common-Items, and Examinees on the Preservation of Equity Properties in Mixed-Format Test Equating

Direct link

Wolf, Raffaela – ProQuest LLC, 2013

Preservation of equity properties was examined using four equating methods--IRT True Score, IRT Observed Score, Frequency Estimation, and Chained Equipercentile--in a mixed-format test under a common-item nonequivalent groups (CINEG) design. Equating of mixed-format tests under a CINEG design can be influenced by factors such as attributes of the…

Descriptors: Testing, Item Response Theory, Equated Scores, Test Items

Construction of Expert Knowledge Monitoring and Assessment System Based on Integral Method of Knowledge Evaluation

Peer reviewed
PDF on ERIC

Download full text

Golovachyova, Viktoriya N.; Menlibekova, Gulbakhyt Zh.; Abayeva, Nella F.; Ten, Tatyana L.; Kogaya, Galina D. – International Journal of Environmental and Science Education, 2016

Using computer-based monitoring systems that rely on tests could be the most effective way of knowledge evaluation. The problem of objective knowledge assessment by means of testing takes on a new dimension in the context of new paradigms in education. The analysis of the existing test methods enabled us to conclude that tests with selected…

Descriptors: Expertise, Computer Assisted Testing, Student Evaluation, Knowledge Level

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

Developing an Array Binary Code Assessment Rubric for Multiple- Choice Questions Using Item Arrays and Binary-Coded Responses

Peer reviewed

Direct link

Haro, Elizabeth K.; Haro, Luis S. – Journal of Chemical Education, 2014

The multiple-choice question (MCQ) is the foundation of knowledge assessment in K-12, higher education, and standardized entrance exams (including the GRE, MCAT, and DAT). However, standard MCQ exams are limited with respect to the types of questions that can be asked when there are only five choices. MCQs offering additional choices more…

Descriptors: Multiple Choice Tests, Coding, Scoring Rubrics, Test Scoring Machines

Testing Measurement Invariance Using MIMIC: Likelihood Ratio Test with a Critical Value Adjustment

Peer reviewed

Direct link

Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012

Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…

Descriptors: Test Items, Simulation, Testing, Statistical Analysis

Twenty Common Testing Mistakes for EFL Teachers to Avoid

Download full text

Henning, Grant – English Teaching Forum, 2012

To some extent, good testing procedure, like good language use, can be achieved through avoidance of errors. Almost any language-instruction program requires the preparation and administration of tests, and it is only to the extent that certain common testing mistakes have been avoided that such tests can be said to be worthwhile selection,…

Descriptors: Testing, English (Second Language), Testing Problems, Student Evaluation

A Multi-Expert Approach for Developing Testing and Diagnostic Systems Based on the Concept-Effect Model

Peer reviewed

Direct link

Panjaburee, Patcharin; Hwang, Gwo-Jen; Triampo, Wannapong; Shih, Bo-Ying – Computers & Education, 2010

With the popularization of computer and communication technologies, researchers have attempted to develop computer-assisted testing and diagnostic systems to help students improve their learning performance on the Internet. In developing a diagnostic system for detecting students' learning problems, it is difficult for individual teachers to…

Descriptors: Learning Problems, Test Items, Testing, Teaching Methods

A Comparison of Computer-Based Testing and Pencil-and-Paper Testing for Students with a Read-Aloud Accommodation

Peer reviewed

Direct link

Flowers, Claudia; Kim, Do-Hong; Lewis, Preston; Davis, Violeta Carmen – Journal of Special Education Technology, 2011

This study examined the academic performance and preference of students with disabilities for two types of test administration conditions, computer-based testing (CBT) and pencil-and-paper testing (PPT). Data from a large-scale assessment program were used to examine differences between CBT and PPT academic performance for third to eleventh grade…

Descriptors: Testing, Test Items, Effect Size, Computer Assisted Testing

Examining NAEP Achievement in Relation to School Testing Conditions in the 2010 Assessments

Download full text

Direct link

Mullis, Ina V. S.; Bohrnstedt, George W.; Preuschoff, Anna Corinna; de los Reyes, Illiana; Stancavage, Fran; Martin, Michael O. – American Institutes for Research, 2012

National Assessment of Educational Progress (NAEP) has expended considerable effort to ensure high quality in data collection by developing standardized materials and survey operation procedures and using well-trained professional administrators. However, schools are allowed to minimize the disruption associated with pulling students out of…

Descriptors: Testing, National Competency Tests, Program Effectiveness, Scores

Some Notes on the Reinvention of Latent Structure Models as Diagnostic Classification Models

Peer reviewed

Direct link

von Davier, Matthias – Measurement: Interdisciplinary Research and Perspectives, 2009

In this commentary, the author points out few issues, one being that there are models mislabeled as diagnostic, which deal with linear decompositions of item difficulties rather than estimating multidimensional skill variables. The author discusses the issue that there are many new names for essentially well-known models for multiple simultaneous…

Descriptors: Test Items, Probability, Models, Diagnostic Tests

Diagnostic Classification Modeling: Opportunity for Identity

Peer reviewed

Direct link

Hancock, Gregory R. – Measurement: Interdisciplinary Research and Perspectives, 2009

As Rupp and Templin (2008) stated directly, diagnostic classification methods "are confirmatory in nature." Methods, though, are neither inherently confirmatory nor exploratory. Diagnostic classification modeling, with its analytical and computational obstacles eventually yielding as a comprehensive and potent discipline emerges, will…

Descriptors: Structural Equation Models, Test Items, Models, Diagnostic Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3

Measurement:…	4
Assessment and Evaluation in…	2
American Institutes for…	1
Applied Measurement in…	1
Center for Assessment and…	1
Computers & Education	1
Educational Studies in…	1
Educational and Psychological…	1
English Teaching Forum	1
Evaluation and the Health…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Testing…	1
Journal of Autism and…	1
Journal of Chemical Education	1
Journal of Educational Data…	1
Journal of Research in…	1
Journal of Special Education…	1
Office of Education, United…	1
Online Submission	1
Optometric Education	1
Practical Assessment,…	1
ProQuest LLC	1
More ▼

Hambleton, Ronald K.	2
Abayeva, Nella F.	1
Bohrnstedt, George W.	1
Bonk, William J.	1
Buitelaar, Jan K.	1
Chansarkar, B. A.	1
Davis, Violeta Carmen	1
Davis-Becker, Susan L.	1
Dietel, Ron	1
Dietz, Claudine	1
Dunkel, Patricia A.	1
Eignor, Daniel R.	1
Flowers, Claudia	1
Gerrow, Jack D.	1
Giesen, Linda A.	1
Golovachyova, Viktoriya N.	1
Griffiths, H. B.	1
Hancock, Gregory R.	1
Hargett, Gary R.	1
Haro, Elizabeth K.	1
Haro, Luis S.	1
Henning, Grant	1
Hwang, Gwo-Jen	1
Jiao, Hong	1
Karkee, Thakur B.	1
More ▼