Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 46 |
Descriptor
Comparative Analysis | 132 |
Test Construction | 132 |
Test Reliability | 132 |
Test Validity | 89 |
Test Items | 26 |
Foreign Countries | 24 |
Higher Education | 21 |
Statistical Analysis | 21 |
Item Analysis | 19 |
Psychometrics | 19 |
Factor Analysis | 18 |
More ▼ |
Source
Author
Benson, Jeri | 3 |
Ebel, Robert L. | 3 |
Brown, James Dean | 2 |
Crehan, Kevin D. | 2 |
Frisbie, David A. | 2 |
Haladyna, Tom | 2 |
Pollack, Judith M. | 2 |
Reckase, Mark D. | 2 |
Weiss, David J. | 2 |
AL-Jawaldeh, Fuad | 1 |
AL-Taj, Heyam | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 13 |
Postsecondary Education | 11 |
Secondary Education | 9 |
Elementary Secondary Education | 7 |
Elementary Education | 6 |
High Schools | 4 |
Intermediate Grades | 3 |
Middle Schools | 3 |
Grade 6 | 2 |
Grade 7 | 2 |
Grade 8 | 2 |
More ▼ |
Audience
Practitioners | 5 |
Teachers | 4 |
Administrators | 3 |
Policymakers | 2 |
Counselors | 1 |
Parents | 1 |
Researchers | 1 |
Support Staff | 1 |
Location
Washington | 3 |
Belgium | 2 |
France | 2 |
Greece | 2 |
Illinois | 2 |
Iran | 2 |
New York | 2 |
Spain | 2 |
Australia | 1 |
Austria | 1 |
California | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kate E. Walton; Cristina Anguiano-Carrasco – ACT, Inc., 2024
Large language models (LLMs), such as ChatGPT, are becoming increasingly prominent. Their use is becoming more and more popular to assist with simple tasks, such as summarizing documents, translating languages, rephrasing sentences, or answering questions. Reports like McKinsey's (Chui, & Yee, 2023) estimate that by implementing LLMs,…
Descriptors: Artificial Intelligence, Man Machine Systems, Natural Language Processing, Test Construction
Maïano, Christophe; Morin, Alexandre J. S.; Tietjens, Maike; Bastos, Tânia; Luiggi, Maxime; Corredeira, Rui; Griffet, Jean; Sánchez-Oliva, David – Measurement in Physical Education and Exercise Science, 2023
The present study sought to examine the psychometric properties of new German, Portuguese, and Spanish versions of the Revised Short Form of the Physical Self-Inventory (PSI-S-"R"), and to contrast these properties against those from the original French version of this instrument. Participants (n = 1802) were 288 French youth, 177 German…
Descriptors: German, Portuguese, Spanish, Test Construction
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Ozdemir, Burhanettin; Gelbal, Selahattin – Education and Information Technologies, 2022
The computerized adaptive tests (CAT) apply an adaptive process in which the items are tailored to individuals' ability scores. The multidimensional CAT (MCAT) designs differ in terms of different item selection, ability estimation, and termination methods being used. This study aims at investigating the performance of the MCAT designs used to…
Descriptors: Scores, Computer Assisted Testing, Test Items, Language Proficiency
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019
This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…
Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring
Bao, Lei; Koenig, Kathleen; Xiao, Yang; Fritchman, Joseph; Zhou, Shaona; Chen, Cheng – Physical Review Physics Education Research, 2022
Abilities in scientific thinking and reasoning have been emphasized as core areas of initiatives, such as the Next Generation Science Standards or the College Board Standards for College Success in Science, which focus on the skills the future will demand of today's students. Although there is rich literature on studies of how these abilities…
Descriptors: Physics, Science Instruction, Teaching Methods, Thinking Skills
Bakhtiar, Mehdi; Wong, Min Ney; Tsui, Emily Ka Yin; McNeil, Malcolm R. – Journal of Speech, Language, and Hearing Research, 2020
Purpose: This study reports the psychometric development of the Cantonese versions of the English Computerized Revised Token Test (CRTT) for persons with aphasia (PWAs) and healthy controls (HCs). Method: The English CRTT was translated into standard Chinese for the Reading--Word Fade version (CRTT-R-[subscript WF]-Cantonese) and into formal…
Descriptors: Psychometrics, Sino Tibetan Languages, Computer Assisted Testing, Aphasia
Shah, Ashima Mathur; Wylie, Caroline; Gitomer, Drew; Noam, Gil – Science Education, 2018
In and out-of-school time (OST) experiences are viewed as complementary in contributing to students' interest, engagement, and performance in science, technology, engineering, and mathematics (STEM). While tools exist to measure quality in general afterschool settings and others to measure structured science classroom experiences, there is a need…
Descriptors: STEM Education, Educational Improvement, Educational Quality, After School Programs
Fauville, Géraldine; Strang, Craig; Cannady, Matthew A.; Chen, Ying-Fang – Environmental Education Research, 2019
The Ocean Literacy movement began in the U.S. in the early 2000s, and has recently become an international effort. The focus on marine environmental issues and marine education is increasing, and yet it has been difficult to show progress of the ocean literacy movement, in part, because no widely adopted measurement tool exists. The International…
Descriptors: Marine Education, Environmental Education, Comparative Analysis, Factor Structure
McClellan, Catherine; Snyder, Rebecca; Woods-Murphy, Maryann; Basset, Katherine – National Network of State Teachers of the Year, 2018
Great teachers recognize great assessments. As policy and education leaders work to make sure state tests are measuring the problem-solving, writing, and critical-thinking skills students need for success, they should convene and rely on teachers to review test quality and help answer the question: Do the questions on our state test reflect…
Descriptors: Student Evaluation, Educational Quality, Standardized Tests, Test Items
Deha Dogan, C.; Canan Karababa, Z.; Fulya Soguksu, A. – Educational Studies, 2017
The purpose of this study is to develop a valid and reliable scale to assess the level of English usage in daily life by students between 15 and 19 years of age, and to compare these students' scale scores according to their achievement levels in an English course. Five hundred and ninety-five participants were randomly selected from a universe.…
Descriptors: Language Usage, English (Second Language), Test Construction, Adolescents
Moore, E. Whitney G.; Brown, Theresa C.; Fry, Mary D. – Measurement in Physical Education and Exercise Science, 2015
The purpose of this study was to develop an abbreviated version of the Perceived Motivational Climate in Exercise Questionnaire (PMCEQ-A) to provide a more practical instrument for use in applied exercise settings. In the calibration step, two shortened versions' measurement and latent model values were compared to each other and the original…
Descriptors: Questionnaires, Psychometrics, Motivation, Exercise
Al-Tal, Suhair; AL-Jawaldeh, Fuad; AL-Taj, Heyam; Maharmeh, Lina – International Education Studies, 2017
This study aimed at revealing the emotional intelligence levels of students with sensory disability in Amman in Jordan. The participants of the study were 200 students; 140 hearing impaired students and 60 visual impaired students enrolled in the special education schools and centers for the academic year 2016-2017. The study adopted the…
Descriptors: Foreign Countries, Emotional Intelligence, Hearing Impairments, Visual Impairments
Winke, Paula; Lee, Shinhye; Ahn, Jieun Irene; Choi, Ina; Cui, Yaqiong; Yoon, Hyung-Jo – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2018
This study investigated the cognitive validity of two child English language tests. Some teachers maintain that these types of tests may be cognitively invalid because native-English-speaking children would not do well on them (Winke, 2011). So the researchers had native speakers and learners of English aged 7 to 9 take sample versions of two…
Descriptors: Language Tests, English, English (Second Language), Second Language Learning