Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 12 |
Descriptor
Test Items | 13 |
Models | 7 |
Test Validity | 7 |
Difficulty Level | 5 |
Foreign Countries | 5 |
Test Construction | 5 |
Statistical Analysis | 4 |
Factor Analysis | 3 |
Scores | 3 |
Structural Equation Models | 3 |
Test Bias | 3 |
More ▼ |
Source
Online Submission | 13 |
Author
Al-Jarf, Reima | 1 |
Buscema, Massimo | 1 |
Carvajal-Espinoza, Jorge | 1 |
Chen, Yi-Hsin | 1 |
Croce, Luigi | 1 |
Custer, Michael | 1 |
De Bastiani, Elisa | 1 |
Edward Paul Getman | 1 |
Finster, Matthew | 1 |
Gomiero, Tiziano | 1 |
Gorin, Joanna | 1 |
More ▼ |
Publication Type
Reports - Research | 10 |
Speeches/Meeting Papers | 5 |
Journal Articles | 4 |
Reports - Descriptive | 2 |
Dissertations/Theses -… | 1 |
Numerical/Quantitative Data | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 3 |
Elementary Education | 2 |
Elementary Secondary Education | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Audience
Policymakers | 1 |
Teachers | 1 |
Location
Iowa | 1 |
Italy | 1 |
Saudi Arabia (Riyadh) | 1 |
Taiwan | 1 |
Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
California Achievement Tests | 1 |
Florida Comprehensive… | 1 |
Medical College Admission Test | 1 |
Program for International… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model
Custer, Michael; Kim, Jongpil – Online Submission, 2023
This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…
Descriptors: Sample Size, Item Response Theory, Test Items, Computation
Al-Jarf, Reima – Online Submission, 2023
This article aims to give a comprehensive guide to planning and designing vocabulary tests which include Identifying the skills to be covered by the test; outlining the course content covered; preparing a table of specifications that shows the skill, content topics and number of questions allocated to each; and preparing the test instructions. The…
Descriptors: Vocabulary Development, Learning Processes, Test Construction, Course Content
Carvajal-Espinoza, Jorge; Welch, Greg W. – Online Submission, 2016
When tests are translated into one or more languages, the question of the equivalence of items across language forms arises. This equivalence can be assessed at the scale level by means of a multiple group confirmatory factor analysis (CFA) in the context of structural equation modeling. This study examined the measurement equivalence of a Spanish…
Descriptors: Translation, Spanish, English, Mathematics Tests
Edward Paul Getman – Online Submission, 2020
Despite calls for engaging assessments targeting young language learners (YLLs) between 8 and 13 years old, what makes assessment tasks engaging and how such task characteristics affect measurement quality have not been well studied empirically. Furthermore, there has been a dearth of validity research about technology-enhanced speaking tests for…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Learner Engagement
Korkmaz, Ö.; Korkmaz, M. K. – Online Submission, 2016
The aim of this study is to improve a measurement tool to evaluate the self-efficacy of Electrical-Electronics Engineering students through their basic electronics skills. The sample group is composed of 124 Electrical-Electronics engineering students. The validity of the scale is analyzed with two different methods through factor analysis and…
Descriptors: Electronics, Skill Development, Self Efficacy, Test Reliability
Finster, Matthew – Online Submission, 2017
This brief presents initial evidence about the reliability and validity of a novice teacher survey and a novice teacher supervisor survey. The novice teacher and novice teacher supervisor surveys assess how well prepared novice teachers are to meet the job requirements of teaching. The surveys are designed to provide educator preparation programs…
Descriptors: Test Construction, Test Validity, Teacher Surveys, Beginning Teachers
He, Wei; Li, Feifei; Wolfe, Edward W.; Mao, Xia – Online Submission, 2012
For those tests solely composed of testlets, local item independency assumption tends to be violated. This study, by using empirical data from a large-scale state assessment program, was interested in investigates the effects of using different models on equating results under the non-equivalent group anchor-test (NEAT) design. Specifically, the…
Descriptors: Test Items, Equated Scores, Models, Item Response Theory
Lorié, William A. – Online Submission, 2013
A reverse engineering approach to automatic item generation (AIG) was applied to a figure-based publicly released test item from the Organisation for Economic Cooperation and Development (OECD) Programme for International Student Assessment (PISA) mathematical literacy cognitive instrument as part of a proof of concept. The author created an item…
Descriptors: Numeracy, Mathematical Concepts, Mathematical Logic, Difficulty Level
Gomiero, Tiziano; Croce, Luigi; Grossi, Enzo; Luc, De Vreese; Buscema, Massimo; Mantesso, Ulrico; De Bastiani, Elisa – Online Submission, 2011
The aim of this paper is to present a shortened version of the SIS (support intensity scale) obtained by the application of mathematical models and instruments, adopting special algorithms based on the most recent developments in artificial adaptive systems. All the variables of SIS applied to 1,052 subjects with ID (intellectual disabilities)…
Descriptors: Foreign Countries, Mathematical Models, Mental Retardation, Measures (Individuals)
Oosterhof, Albert; Rohani, Faranak; Sanfilippo, Carol; Stillwell, Peggy; Hawkins, Karen – Online Submission, 2008
In assessment, the ability to construct test items that measure a targeted skill is fundamental to validity and alignment. The ability to do the reverse is also important: determining what skill an existing test item measures. This paper presents a model for classifying test items that builds on procedures developed by others, including Bloom…
Descriptors: Test Items, Classification, Models, Cognitive Ability
Tristan, Agustin; Vidal, Rafael – Online Submission, 2007
Wright and Stone had proposed three features to assess the quality of the distribution of the items difficulties in a test, on the so called "most probable response map": line, stack and gap. Once a line is accepted as a design model for a test, gaps and stacks are practically eliminated, producing an evidence of the "scale…
Descriptors: Test Validity, Models, Difficulty Level, Test Items
Chen, Yi-Hsin; Gorin, Joanna; Thompson, Marilyn; Tatsuoka, Kikumi – Online Submission, 2006
Educational assessment is a process of collecting evidence and interpreting it to provide instructors with information regarding students' learning. However, the current design and scoring of most standardized educational tests are insufficient to serve this purpose. The limitation exists primarily due to the lack of cognitive information…
Descriptors: Foreign Countries, Grade 8, Psychometrics, Probability
Rizavi, Saba; Way, Walter D.; Lu, Ying; Pitoniak, Mary; Steffen, Manfred – Online Submission, 2004
The purpose of this study was to use realistically simulated data to evaluate various CAT designs for use with the verbal reasoning measure of the Medical College Admissions Test (MCAT). Factors such as item pool depth, content constraints, and item formats often cause repeated adaptive administrations of an item at ability levels that are not…
Descriptors: Test Items, Test Bias, Item Banks, College Admission