Publication Date
In 2025 | 34 |
Since 2024 | 78 |
Since 2021 (last 5 years) | 173 |
Since 2016 (last 10 years) | 361 |
Since 2006 (last 20 years) | 552 |
Descriptor
Test Validity | 1649 |
Test Reliability | 622 |
Computer Assisted Testing | 400 |
Test Construction | 384 |
Foreign Countries | 348 |
Testing | 333 |
Testing Problems | 322 |
Higher Education | 267 |
Comparative Testing | 216 |
Scores | 206 |
Language Tests | 177 |
More ▼ |
Source
Author
Thompson, Bruce | 8 |
Byrne, Barbara M. | 6 |
Bowman, Harry L. | 5 |
Ling, Guangming | 5 |
McKown, Clark | 5 |
Reckase, Mark D. | 5 |
Weiss, David J. | 5 |
Abedi, Jamal | 4 |
Bulut, Okan | 4 |
Hambleton, Ronald K. | 4 |
Steinberg, Jonathan | 4 |
More ▼ |
Publication Type
Education Level
Location
Canada | 40 |
China | 26 |
Australia | 25 |
California | 21 |
Turkey | 18 |
Germany | 17 |
Indonesia | 17 |
United Kingdom (England) | 15 |
Israel | 14 |
United Kingdom | 14 |
Florida | 11 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Gökhan Iskifoglu – Turkish Online Journal of Educational Technology - TOJET, 2024
This research paper investigated the importance of conducting measurement invariance analysis in developing measurement tools for assessing differences between and among study variables. Most of the studies, which tended to develop an inventory to assess the existence of an attitude, behavior, belief, IQ, or an intuition in a person's…
Descriptors: Testing, Testing Problems, Error of Measurement, Attitude Measures
Angela Chamberlain; Emily D'Arcy; Andrew J. O. Whitehouse; Kerry Wallace; Maya Hayden-Evans; Sonya Girdler; Benjamin Milbourn; Sven Bölte; Kiah Evans – Journal of Autism and Developmental Disorders, 2025
Purpose: The PEDI-CAT (ASD) is used to assess functioning of children and youth on the autism spectrum; however, current psychometric evidence is limited. This study aimed to explore the reliability, validity and acceptability of the PEDI-CAT (ASD) using a large Australian sample. Methods: Caregivers of 134 children and youth on the spectrum…
Descriptors: Autism Spectrum Disorders, Children, Youth, Test Reliability
Juan Mendelsohn Ontong; Mareli Rossouw – Cogent Education, 2024
The purpose of this study was to examine the effectiveness of providing extra time as an accommodation to students with learning disabilities (LD) in higher education institutions. The results, which are based in the setting of a South African accountancy programme, provides a unique context where time, in time-constrained assessments, are often…
Descriptors: Foreign Countries, Undergraduate Students, Accounting, Business Administration Education
Jyun-Hong Chen; Hsiu-Yi Chao – Journal of Educational and Behavioral Statistics, 2024
To solve the attenuation paradox in computerized adaptive testing (CAT), this study proposes an item selection method, the integer programming approach based on real-time test data (IPRD), to improve test efficiency. The IPRD method turns information regarding the ability distribution of the population from real-time test data into feasible test…
Descriptors: Data Use, Computer Assisted Testing, Adaptive Testing, Design
Tahereh Firoozi; Hamid Mohammadi; Mark J. Gierl – Journal of Educational Measurement, 2025
The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language-agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were…
Descriptors: College Students, Slavic Languages, German, Italian
Jeff Allen; Jay Thomas; Stacy Dreyer; Scott Johanningmeier; Dana Murano; Ty Cruce; Xin Li; Edgar Sanchez – ACT Education Corp., 2025
This report describes the process of developing and validating the enhanced ACT. The report describes the changes made to the test content and the processes by which these design decisions were implemented. The authors describe how they shared the overall scope of the enhancements, including the initial blueprints, with external expert panels,…
Descriptors: College Entrance Examinations, Testing, Change, Test Construction
Luis Felipe Dias Lopes; Fabiane Volpato Chiapinoto; Martiele Gonçalves Moreira; Nuvea Kuhn; Fillipe Grando Lopes; Luciana Davi Traverso; Deoclécio Junior Cardoso Silva; Gilnei Luiz de Moura – Journal of Education and Learning, 2024
This study aimed to validate a scale for subjectively measuring teaching competencies for innovation in higher education. The scale was developed by creating a set of items that underwent content validity through the Delphi technique and face validity. A survey was then conducted with 523 higher education professors. The resulting scale, called…
Descriptors: Foreign Countries, College Faculty, Teacher Competencies, Teacher Competency Testing
Musa Adekunle Ayanwale; Mdutshekelwa Ndlovu – Journal of Pedagogical Research, 2024
The COVID-19 pandemic has had a significant impact on high-stakes testing, including the national benchmark tests in South Africa. Current linear testing formats have been criticized for their limitations, leading to a shift towards Computerized Adaptive Testing [CAT]. Assessments with CAT are more precise and take less time. Evaluation of CAT…
Descriptors: Adaptive Testing, Benchmarking, National Competency Tests, Computer Assisted Testing
Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024
A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…
Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability
Meagan Karvonen; Russell Swinburne Romine; Amy K. Clark – Practical Assessment, Research & Evaluation, 2024
This paper describes methods and findings from student cognitive labs, teacher cognitive labs, and test administration observations as evidence evaluated in a validity argument for a computer-based alternate assessment for students with significant cognitive disabilities. Validity of score interpretations and uses for alternate assessments based…
Descriptors: Students with Disabilities, Intellectual Disability, Severe Disabilities, Student Evaluation
Süleyman Demir; Derya Çobanoglu Aktan; Nese Güler – International Journal of Assessment Tools in Education, 2023
This study has two main purposes. Firstly, to compare the different item selection methods and stopping rules used in Computerized Adaptive Testing (CAT) applications with simulative data generated based on the item parameters of the Vocational Maturity Scale. Secondly, to test the validity of CAT application scores. For the first purpose,…
Descriptors: Computer Assisted Testing, Adaptive Testing, Vocational Maturity, Measures (Individuals)
Ke-Hai Yuan; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
Mediation analysis plays an important role in understanding causal processes in social and behavioral sciences. While path analysis with composite scores was criticized to yield biased parameter estimates when variables contain measurement errors, recent literature has pointed out that the population values of parameters of latent-variable models…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Comparative Testing
Bernis Sütçübasi; Tugçe Balli; Herbert Roeyers; Jan R. Wiersema; Sami Çamkerten; Ozan Cem Öztürk; Baris Metin; Edmund Sonuga-Barke – Journal of Attention Disorders, 2025
Objective: ADHD and autism are complex and frequently co-occurring neurodevelopmental conditions with shared etiological and pathophysiological elements. In this paper, we attempt to differentiate these conditions among the young people in terms of intrinsic patterns of brain connectivity revealed during resting state using machine learning…
Descriptors: Elementary School Students, Secondary School Students, Attention Deficit Hyperactivity Disorder, Autism Spectrum Disorders
Kayla V. Campaña; Benjamin G. Solomon – Assessment for Effective Intervention, 2025
The purpose of this study was to compare the classification accuracy of data produced by the previous year's end-of-year New York state assessment, a computer-adaptive diagnostic assessment ("i-Ready"), and the gating combination of both assessments to predict the rate of students passing the following year's end-of-year state assessment…
Descriptors: Accuracy, Classification, Diagnostic Tests, Adaptive Testing
K. Talman; J. Vierula; T. Karihtala; E. Laakkonen; J. Engblom; E. Haavisto – Higher Education Quarterly, 2025
Higher education institutions need to develop valid, fair, and objective selection methods. Current literature reporting the development and validation of new national large-scale selection tests is scarce. This two-phased study aimed to (1) develop and (2) evaluate the validity of the Finnish digital Universities of Applied Sciences Entrance…
Descriptors: Admission Criteria, Test Construction, Test Validity, Computer Assisted Testing