NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 736 to 750 of 9,533 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Salim Nabhan; Anita Habók – SAGE Open, 2025
As the integration of digital technologies continues to shape academic landscapes, assessing digital literacy in the context of academic writing becomes paramount. Several instruments and frameworks are available for measuring digital literacy and examining it from different perspectives; however, none are suitable for measuring the digital…
Descriptors: Digital Literacy, Academic Language, Writing (Composition), Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Zhiqiang Yang; Chengyuan Yu – Asia Pacific Education Review, 2025
This study investigated the test fairness of the translation section of a large-scale English test in China by examining its Differential Test Functioning (DTF) and Differential Item Functioning (DIF) across gender and major. Regarding DTF, the entire translation section exhibits partial strong measurement invariance across female and male…
Descriptors: Multiple Choice Tests, Test Items, Scoring, Translation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Russell, Michael; Moncaleano, Sebastian – Practical Assessment, Research & Evaluation, 2020
Although both content alignment and standard-setting procedures rely on content-expert panel judgements, only the latter employs discussion among panel members. This study employed a modified form of the Webb methodology to examine content alignment for twelve tests administered as part of the Massachusetts Comprehensive Assessment System (MCAS).…
Descriptors: Test Content, Test Items, Discussion, Test Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gurdil Ege, Hatice; Demir, Ergul – Eurasian Journal of Educational Research, 2020
Purpose: The present study aims to evaluate how the reliabilities computed using a, Stratified a, Angoff-Feldt, and Feldt-Raju estimators may differ when sample size (500, 1000, and 2000) and item type ratio of dichotomous to polytomous items (2:1; 1:1, 1:2) included in the scale are varied. Research Methods: In this study, Cronbach's a,…
Descriptors: Test Format, Simulation, Test Reliability, Sample Size
Peer reviewed Peer reviewed
Direct linkDirect link
Holme, Thomas A.; Bauer, Christopher; Trate, Jaclyn M.; Reed, Jessica J.; Raker, Jeffrey R.; Murphy, Kristen L. – Journal of Chemical Education, 2020
The American Chemical Society, Division of Chemical Education, Examinations Institute has been developing content maps for the undergraduate program based on subdiscipline specifications since 2008. The Anchoring Concepts Content Maps (or ACCM) have been published in four subdisciplines (general, organic, physical, and inorganic chemistry) with…
Descriptors: Undergraduate Students, Chemistry, Scientific Concepts, Concept Mapping
Peer reviewed Peer reviewed
Direct linkDirect link
Cole, Brian S.; Lima-Walton, Elia; Brunnert, Kim; Vesey, Winona Burt; Raha, Kaushik – Journal of Applied Testing Technology, 2020
Automatic item generation can rapidly generate large volumes of exam items, but this creates challenges for assembly of exams which aim to include syntactically diverse items. First, we demonstrate a diminishing marginal syntactic return for automatic item generation using a saturation detection approach. This analysis can help users of automatic…
Descriptors: Artificial Intelligence, Automation, Test Construction, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Krach, Shelley Kathleen; McCreery, Michael P.; Dennis, Lindsay; Guerard, Jessika; Harris, Erica L. – Psychology in the Schools, 2020
Pearson now uses a technology-based testing platform, Q-Interactive, to administer tests previously available in paper versions. The same norms are used for both versions; Pearson's in-house equivalency studies indicated that both versions are equated. The goal of the current study is to independently evaluate equivalency findings. For the current…
Descriptors: Preschool Children, Computer Assisted Testing, Test Items, Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ismail, Yilmaz – Educational Research and Reviews, 2020
This study draws on the understanding that when the correlation between variables is not known yet the non-linear expectation in the correlation between the variables is present, non-linear measurement tools can be used. In education, possibility measurement tools can be used for non-linear measurement. Multiple-choice possibility measurement…
Descriptors: Multiple Choice Tests, Measurement Techniques, Student Evaluation, Test Items
Derek Sauder – ProQuest LLC, 2020
The Rasch model is commonly used to calibrate multiple choice items. However, the sample sizes needed to estimate the Rasch model can be difficult to attain (e.g., consider a small testing company trying to pretest new items). With small sample sizes, auxiliary information besides the item responses may improve estimation of the item parameters.…
Descriptors: Item Response Theory, Sample Size, Computation, Test Length
Peer reviewed Peer reviewed
Direct linkDirect link
Chengran Wang; Bing Wei – Physical Review Physics Education Research, 2024
The notion of scientific visual literacy has been advocated in recent science curriculum reform documents and related learning outcomes are expected from students. However, few studies have been conducted to determine how it is tested in high-stakes examinations. This study utilized the Visualization Blooming Tool to examine the level of visual…
Descriptors: Physics, Scientific Literacy, Science Tests, Thinking Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Amber Dudley; Emma Marsden; Giulia Bovolenta – Language Testing, 2024
Vocabulary knowledge strongly predicts second language reading, listening, writing, and speaking. Yet, few tests have been developed to assess vocabulary knowledge in French. The primary aim of this pilot study was to design and initially validate the Context-Aligned Two Thousand Test (CA-TTT), following open research practices. The CA-TTT is a…
Descriptors: French, Vocabulary Development, Secondary School Students, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Yi-Chun Chen; Hsin-Kai Wu; Ching-Ting Hsin – Research in Science & Technological Education, 2024
Background and Purpose: As a growing number of instructional units have been developed to promote young children's scientific and engineering practices (SEPs), understanding how to evaluate and assess children's SEPs is imperative. However, paper-and-pencil assessments would not be suitable for young children because of their limited reading and…
Descriptors: Science Education, Engineering Education, Elementary School Students, Middle School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Katrin Klingbeil; Fabian Rösken; Bärbel Barzel; Florian Schacht; Kaye Stacey; Vicki Steinle; Daniel Thurm – ZDM: Mathematics Education, 2024
Assessing students' (mis)conceptions is a challenging task for teachers as well as for researchers. While individual assessment, for example through interviews, can provide deep insights into students' thinking, this is very time-consuming and therefore not feasible for whole classes or even larger settings. For those settings, automatically…
Descriptors: Multiple Choice Tests, Formative Evaluation, Mathematics Tests, Misconceptions
Peer reviewed Peer reviewed
Direct linkDirect link
Kokou A. Atitsogbe; Jean-Luc Bernaud – International Journal for Educational and Vocational Guidance, 2024
This manuscript aimed to develop an instrument assessing vocational values among students (VVS-S). The scale was developed in French using three different samples of Togolese participants for item development (N = 140), exploratory (N = 308) and confirmatory analyses (N = 300). It consists of 17 items divided into the five subscales of Power,…
Descriptors: Vocational Interests, Values, Measures (Individuals), Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Philomina Abena Anyidoho; Rebecca Berenbon; Bridget McHugh – International Journal of Training and Development, 2024
Many workforce development training programmes use learning gains as a measure of programme effectiveness. However, research on K-12 education suggests that posttest scores may be influenced by pretesting effects. Pretesting may improve posttest performance by giving learners preknowledge of posttest content. Alternatively, pretesting may enhance…
Descriptors: Trainees, Trainers, Labor Force Development, High Stakes Tests
Pages: 1  |  ...  |  46  |  47  |  48  |  49  |  50  |  51  |  52  |  53  |  54  |  ...  |  636