Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 44 |
Descriptor
Evaluation Methods | 75 |
Measurement Techniques | 75 |
Test Items | 75 |
Item Response Theory | 23 |
Test Construction | 22 |
Models | 17 |
Psychometrics | 16 |
Simulation | 13 |
Student Evaluation | 12 |
Research Methodology | 11 |
Test Validity | 11 |
More ▼ |
Source
Author
Blunk, Merrie | 2 |
Hill, Heather C. | 2 |
Penfield, Randall D. | 2 |
Steffen, Manfred | 2 |
Wang, Wen-Chung | 2 |
van der Linden, Wim J. | 2 |
Ackerman, Terry A. | 1 |
Adom, Dickson | 1 |
Aggen, Steven H. | 1 |
Avsec, Stanislav | 1 |
Azevedo, Jose | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 7 |
Higher Education | 7 |
Elementary Education | 4 |
Grade 4 | 4 |
Grade 8 | 4 |
Secondary Education | 3 |
Grade 6 | 2 |
High Schools | 2 |
Middle Schools | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
More ▼ |
Audience
Practitioners | 4 |
Teachers | 4 |
Researchers | 2 |
Location
Arizona | 1 |
California | 1 |
Ghana | 1 |
Japan | 1 |
Massachusetts | 1 |
Portugal | 1 |
South Africa | 1 |
South Korea | 1 |
Turkey | 1 |
United Kingdom (Great Britain) | 1 |
United States | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
ACT Assessment | 1 |
California Achievement Tests | 1 |
Medical College Admission Test | 1 |
National Assessment of… | 1 |
North Carolina End of Course… | 1 |
Program for International… | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Adom, Dickson; Mensah, Jephtar Adu; Dake, Dennis Atsu – International Journal of Evaluation and Research in Education, 2020
Test, measurement, and evaluation are concepts used in education to explain how the progress of learning and the final learning outcomes of students are assessed. However, the terms are often misused in the field of education, especially in Ghana. The objective of the study was to thoroughly explain the concepts to assist educationists and…
Descriptors: Foreign Countries, Educational Research, Evaluation Methods, Measurement Techniques
Kaya Uyanik, Gulden; Demirtas Tolaman, Tugba; Gur Erdogan, Duygu – International Journal of Assessment Tools in Education, 2021
This paper aims to examine and assess the questions included in the "Turkish Common Exam" for sixth graders held in the first semester of 2018 which is one of the common exams carried out by The Measurement and Evaluation Centers, in terms of question structure, quality and taxonomic value. To this end, the test questions were examined…
Descriptors: Foreign Countries, Grade 6, Standardized Tests, Test Items
Setiawan, Risky – European Journal of Educational Research, 2019
The purposes of this research are: 1) to compare two equalizing tests conducted with Hebara and Stocking Lord method; 2) to describe the characteristics of each equalizing test method using windows' IRTEQ program. This research employs a participatory approach as the data are collected through questionnaires based on the National Examination…
Descriptors: Equated Scores, Evaluation Methods, Evaluation Criteria, Test Items
Kaplan, David; Su, Dan – Large-scale Assessments in Education, 2018
Background: This paper extends a recent study by Kaplan and Su ("J Educ Behav Stat" 41: 51-80, 2016) examining the problem of matrix sampling of context questionnaire scales with respect to the generation of plausible values of cognitive outcomes in large-scale assessments. Methods: Following Weirich et al. ("Nested multiple…
Descriptors: Questionnaires, Measurement, Measurement Techniques, Evaluation Methods
Avsec, Stanislav; Jamšek, Janez – International Journal of Technology and Design Education, 2016
Technological literacy is identified as a vital achievement of technology- and engineering-intensive education. It guides the design of technology and technical components of educational systems and defines competitive employment in technological society. Existing methods for measuring technological literacy are incomplete or complicated,…
Descriptors: Technological Literacy, Elementary School Students, Secondary School Students, Evaluation Methods
Hou, Likun; de la Torre, Jimmy; Nandakumar, Ratna – Journal of Educational Measurement, 2014
Analyzing examinees' responses using cognitive diagnostic models (CDMs) has the advantage of providing diagnostic information. To ensure the validity of the results from these models, differential item functioning (DIF) in CDMs needs to be investigated. In this article, the Wald test is proposed to examine DIF in the context of CDMs. This study…
Descriptors: Test Bias, Models, Simulation, Error Patterns
He, Yong – ProQuest LLC, 2013
Common test items play an important role in equating multiple test forms under the common-item nonequivalent groups design. Inconsistent item parameter estimates among common items can lead to large bias in equated scores for IRT true score equating. Current methods extensively focus on detection and elimination of outlying common items, which…
Descriptors: Test Items, Regression (Statistics), Simulation, Comparative Analysis
Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012
Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…
Descriptors: Test Items, Simulation, Testing, Statistical Analysis
Kim, Do-Hong; Lambert, Richard G.; Burts, Diane C. – Early Education and Development, 2013
Research Findings: This study examined the measurement equivalence of the "Teaching Strategies GOLD[R]" assessment system across subgroups of children based on their primary language and disability status. This study is based on teacher-collected assessment data for 3-, 4-, and 5-year-old children for the fall of 2010, winter of 2010, and spring…
Descriptors: English Language Learners, Teaching Methods, Educational Strategies, Special Needs Students
Torres, Cristina; Lopes, Ana Paula; Babo, Lurdes; Azevedo, Jose – Online Submission, 2011
A MC (multiple-choice) question can be defined as a question in which students are asked to select one alternative from a given set of alternatives in response to a question stem. The objective of this paper is to analyse if MC questions may be considered as an interesting alternative for assessing knowledge, particularly in the mathematics area,…
Descriptors: Multiple Choice Tests, Alternative Assessment, Evaluation Methods, Questioning Techniques
Swail, Watson Scott – College and University, 2011
College rankings create much talk and discussion in the higher education arena. This love/hate relationship has not necessarily resulted in better rankings, but rather, more rankings. This paper looks at some of the measures and pitfalls of the current rankings systems, and proposes areas for improvement through a better focus on teaching and…
Descriptors: Higher Education, Measurement Objectives, Measurement Techniques, Classification
Lee, Won-Chan; Ban, Jae-Chun – Applied Measurement in Education, 2010
Various applications of item response theory often require linking to achieve a common scale for item parameter estimates obtained from different groups. This article used a simulation to examine the relative performance of four different item response theory (IRT) linking procedures in a random groups equating design: concurrent calibration with…
Descriptors: Item Response Theory, Simulation, Comparative Analysis, Measurement Techniques
Potgieter, Marietjie; Malatje, Esther; Gaigher, Estelle; Venter, Elsie – International Journal of Science Education, 2010
This study investigated the use of performance-confidence relationships to signal the presence of alternative conceptions and inadequate problem-solving skills in mechanics. A group of 33 students entering physics at a South African university participated in the project. The test instrument consisted of 20 items derived from existing standardised…
Descriptors: Foreign Countries, Instructional Design, Test Items, Mechanics (Physics)
Wang, Wen-Chung; Shih, Ching-Lin; Yang, Chih-Chien – Educational and Psychological Measurement, 2009
This study implements a scale purification procedure onto the standard MIMIC method for differential item functioning (DIF) detection and assesses its performance through a series of simulations. It is found that the MIMIC method with scale purification (denoted as M-SP) outperforms the standard MIMIC method (denoted as M-ST) in controlling…
Descriptors: Test Items, Measures (Individuals), Test Bias, Evaluation Research
Klein Entink, R. H.; Fox, J. P.; van der Linden, W. J. – Psychometrika, 2009
Response times on test items are easily collected in modern computerized testing. When collecting both (binary) responses and (continuous) response times on test items, it is possible to measure the accuracy and speed of test takers. To study the relationships between these two constructs, the model is extended with a multivariate multilevel…
Descriptors: Test Items, Markov Processes, Item Response Theory, Measurement Techniques