Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 15 |
Since 2006 (last 20 years) | 73 |
Descriptor
Evaluation Methods | 89 |
Psychometrics | 89 |
Test Items | 89 |
Test Construction | 34 |
Item Response Theory | 32 |
Models | 23 |
Student Evaluation | 23 |
Test Validity | 21 |
Educational Assessment | 20 |
Foreign Countries | 16 |
Measurement Techniques | 16 |
More ▼ |
Source
Author
Abedi, Jamal | 2 |
Bowles, Ryan P. | 2 |
Frey, Andreas | 2 |
Goodwin, Sarah | 2 |
Hartig, Johannes | 2 |
Holling, Heinz | 2 |
Konishi, Haruka | 2 |
Robitzsch, Alexander | 2 |
Skibbe, Lori E. | 2 |
Troia, Gary A. | 2 |
Akarsu, Bayram | 1 |
More ▼ |
Publication Type
Education Level
Location
Canada | 3 |
Australia | 2 |
Germany | 2 |
Alabama | 1 |
California | 1 |
China | 1 |
Dominica | 1 |
Florida | 1 |
Grenada | 1 |
Italy | 1 |
Massachusetts | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Deschênes, Marie-France; Dionne, Éric; Dorion, Michelle; Grondin, Julie – Practical Assessment, Research & Evaluation, 2023
The use of the aggregate scoring method for scoring concordance tests requires the weighting of test items to be derived from the performance of a group of experts who take the test under the same conditions as the examinees. However, the average score of experts constituting the reference panel remains a critical issue in the use of these tests.…
Descriptors: Scoring, Tests, Evaluation Methods, Test Items
Fu Chen; Ying Cui; Alina Lutsyk-King; Yizhu Gao; Xiaoxiao Liu; Maria Cutumisu; Jacqueline P. Leighton – Education and Information Technologies, 2024
Post-secondary data literacy education is critical to students' academic and career success. However, the literature has not adequately addressed the conceptualization and assessment of data literacy for post-secondary students. In this study, we introduced a novel digital performance-based assessment for teaching and evaluating post-secondary…
Descriptors: Performance Based Assessment, College Students, Information Literacy, Evaluation Methods
Matthew John Davidson – ProQuest LLC, 2022
Digitally-based assessments create opportunities for collecting moment to moment information about how students are responding to assessment items. This information, called log or process data, has long been regarded as a vast and valuable source of data about student performance. Despite repeated assurances of its vastness and value, process data…
Descriptors: Data Use, Psychometrics, Item Response Theory, Test Items
Kate E. Williams; Magdalena Janus; Linda J. Harrison; Sandie Wong; Sheena Elwick; Laura McFarland – Australasian Journal of Early Childhood, 2024
Child observation is a critical component of quality pedagogy in early childhood education and care (ECEC). The ORICL (Observe, Reflect, Improve Children's Learning) tool was co-designed by ECEC researchers, policymakers, leaders, and practitioners to support this work. Educators rate the experiences of individual children, and responses of…
Descriptors: Capacity Building, Infants, Toddlers, Early Childhood Education
Meng, Yaru; Fu, Hua – Modern Language Journal, 2023
The distinguishing feature of dynamic assessment (DA) is the dialectical integration of assessment and instruction. However, how to design the targeted instruction or mediation has been relatively underexplored. To address this gap, this study proposes the attribute-based mediation model (AMM), an English-as-a-foreign-language listening mediation…
Descriptors: Evaluation Methods, Teaching Methods, Models, English (Second Language)
Furter, Robert T.; Dwyer, Andrew C. – Applied Measurement in Education, 2020
Maintaining equivalent performance standards across forms is a psychometric challenge exacerbated by small samples. In this study, the accuracy of two equating methods (Rasch anchored calibration and nominal weights mean) and four anchor item selection methods were investigated in the context of very small samples (N = 10). Overall, nominal…
Descriptors: Classification, Accuracy, Item Response Theory, Equated Scores
Thapelo Ncube Whitfield – ProQuest LLC, 2021
Student Experience surveys are used to measure student attitudes towards their campus as well as to initiate conversations for institutional change. Validity evidence to support the interpretations of these surveys' results, however, is lacking. The first purpose of this study was to compare three Differential Item Functioning (DIF) methods on…
Descriptors: College Students, Student Surveys, Student Experience, Student Attitudes
Skibbe, Lori E.; Bowles, Ryan P.; Goodwin, Sarah; Troia, Gary A.; Konishi, Haruka – Language, Speech, and Hearing Services in Schools, 2020
Purpose: The Access to Literacy Assessment System--Phonological Awareness (ATLAS-PA) was developed for use with children with speech and/or language impairment. The subtests (Rhyming, Blending, and Segmenting) are appropriate for children who are 3-7 years of age. ATLAS-PA is composed entirely of receptive items, incorporates individualized levels…
Descriptors: Phonological Awareness, Speech Impairments, Language Impairments, Young Children
Skibbe, Lori E.; Bowles, Ryan P.; Goodwin, Sarah; Troia, Gary A.; Konishi, Haruka – Grantee Submission, 2020
Purpose: The Access to Literacy Assessment System--Phonological Awareness (ATLAS-PA) was developed for use with children with speech and/or language impairment. The subtests (rhyming, blending, segmenting) are appropriate for children who are 3 to 7 years of age. ATLAS-PA is comprised entirely of receptive items, incorporates individualized levels…
Descriptors: Phonological Awareness, Speech Impairments, Language Impairments, Young Children
Michelle M. Neumann; Jason L. Anthony; Noé A. Erazo; David L. Neumann – Grantee Submission, 2019
The framework and tools used for classroom assessment can have significant impacts on teacher practices and student achievement. Getting assessment right is an important component in creating positive learning experiences and academic success. Recent government reports (e.g., United States, Australia) call for the development of systems that use…
Descriptors: Early Childhood Education, Futures (of Society), Educational Assessment, Evaluation Methods
Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2017
This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…
Descriptors: Psychometrics, Test Items, Item Response Theory, Hypothesis Testing
Hardré, Patricia L.; Hackett, Shannon – Educational Assessment, Evaluation and Accountability, 2015
This manuscript chronicles the process and products of a redesign for evaluation of the graduate college experience (GCE) which was initiated by a university graduate college, based on its observed need to reconsider and update its measures and methods for assessing graduate students' experiences. We examined the existing instrumentation and…
Descriptors: Test Construction, Graduate Students, Student Experience, Evaluation Methods
Hou, Likun; de la Torre, Jimmy; Nandakumar, Ratna – Journal of Educational Measurement, 2014
Analyzing examinees' responses using cognitive diagnostic models (CDMs) has the advantage of providing diagnostic information. To ensure the validity of the results from these models, differential item functioning (DIF) in CDMs needs to be investigated. In this article, the Wald test is proposed to examine DIF in the context of CDMs. This study…
Descriptors: Test Bias, Models, Simulation, Error Patterns
Thummaphan, Phonraphee – ProQuest LLC, 2017
The present study aimed to represent the innovative assessments that support students' learning in STEM education through using the integrative framework for Cognitive Diagnostic Modeling (CDM). This framework is based on three components, cognition, observation, and interpretation (National Research Council, 2001). Specifically, this dissertation…
Descriptors: STEM Education, Cognitive Processes, Observation, Psychometrics
Kaspar, Roman; Döring, Ottmar; Wittmann, Eveline; Hartig, Johannes; Weyland, Ulrike; Nauerth, Annette; Möllers, Michaela; Rechenbach, Simone; Simon, Julia; Worofka, Iberé – Vocations and Learning, 2016
Valid and reliable standardized assessment of nursing competencies is needed to monitor the quality of vocational education and training (VET) in nursing and evaluate learning outcomes for care work trainees with increasingly heterogeneous learning backgrounds. To date, however, the modeling of professional competencies has not yet evolved into…
Descriptors: Nursing Education, Geriatrics, Video Technology, Computer Assisted Testing