Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 13 |
Descriptor
Foreign Countries | 18 |
Test Length | 18 |
Test Reliability | 18 |
Test Validity | 9 |
Test Construction | 5 |
Comparative Analysis | 4 |
Computer Assisted Testing | 4 |
Difficulty Level | 4 |
Correlation | 3 |
Factor Analysis | 3 |
High Stakes Tests | 3 |
More ▼ |
Source
Author
Almeida, Leandro S. | 1 |
Arens, A. Katrin | 1 |
Bechger, Timo M. | 1 |
Benton, Tom | 1 |
Bulut, Okan | 1 |
Casanova, Joana R. | 1 |
Catts, Ralph | 1 |
Chen, Hsueh-Chu | 1 |
Chia-Ying Chu | 1 |
Chieh-An Chen | 1 |
Craven, Rhonda G. | 1 |
More ▼ |
Publication Type
Journal Articles | 15 |
Reports - Research | 14 |
Reports - Descriptive | 3 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Secondary Education | 3 |
Elementary Education | 2 |
Early Childhood Education | 1 |
Elementary Secondary Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
More ▼ |
Audience
Location
China | 4 |
Turkey | 3 |
Australia | 2 |
Canada | 2 |
Ireland | 2 |
Netherlands | 2 |
Singapore | 2 |
United Kingdom | 2 |
Germany | 1 |
Japan | 1 |
Kenya | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Fennema Sherman Mathematics… | 1 |
National Assessment of… | 1 |
Self Description Questionnaire | 1 |
What Works Clearinghouse Rating
Chia-Ying Chu; Pei-Hua Chen; Yi-Shin Tsai; Chieh-An Chen; Yi-Chih Chan; Yan-Jhe Ciou – Journal of Deaf Studies and Deaf Education, 2024
This study investigated the impact of language sample length on mean length of utterance (MLU) and aimed to determine the minimum number of utterances required for a reliable MLU. Conversations were collected from Mandarin-speaking, hard-of-hearing and typical-hearing children aged 16-81 months. The MLUs were calculated using sample sizes ranging…
Descriptors: Foreign Countries, Mandarin Chinese, Young Children, Language Acquisition
Tom Benton – Research Matters, 2024
Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…
Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction
Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025
This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…
Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests
Benton, Tom – Research Matters, 2021
Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…
Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level
Casanova, Joana R.; Almeida, Leandro S.; Peixoto, Francisco; Ribeiro, Rui-Bártolo; Marôco, João – SAGE Open, 2019
Academic expectations play a significant role in the quality of student adaptation and academic success. Previous research suggests that expectations are a multidimensional construct, making it crucial to test the measures used for this important characteristic. Because assessment of student adaptation to higher education comprises a multitude of…
Descriptors: Foreign Countries, College Freshmen, Questionnaires, Expectation
Feher, Anita; Smith, Martin M.; Saklofske, Donald H.; Plouffe, Rachel A.; Wilson, Claire A.; Sherry, Simon B. – Journal of Psychoeducational Assessment, 2020
The Big Three Perfectionism Scale (BTPS) is a 45-item self-report measure of perfectionism with three overarching factors: rigid, self-critical, and narcissistic perfectionism. Our objective was to create a brief version of the BTPS, the Big Three Perfectionism Scale--Short Form (BTPS-SF). Sixteen items were selected, and confirmatory factor…
Descriptors: Personality Measures, Personality Traits, Test Construction, Measurement Techniques
Senel, Selma; Kutlu, Ömer – European Journal of Special Needs Education, 2018
This paper examines listening comprehension skills of visually impaired students (VIS) using computerised adaptive testing (CAT) and reader-assisted paper-pencil testing (raPPT) and student views about them. Explanatory mixed method design was used in this study. Sample is comprised of 51 VIS, in 7th and 8th grades. 9 of these students were…
Descriptors: Computer Assisted Testing, Adaptive Testing, Visual Impairments, Student Attitudes
Kinyua, Kiragu; Okunya, Luke Odiemo – African Educational Research Journal, 2014
This study was carried out to establish the factors influencing the validity and reliability of teacher made tests in Kenya. It was conducted in Nyahururu District of Laikipia County in Kenya. The study involved 42 teachers and 15 key informants selected from teachers holding various positions of academic responsibilities in their schools in…
Descriptors: Tests, Test Validity, Test Reliability, Physics
Dikici, Ayhan; Soh, Kaycheng – Online Submission, 2015
Many measurement tools on creativity are available in the literature. One of these scales is Creativity Fostering Teacher Behaviour Index (CFTIndex) developed for Singaporean teacher originally. It was then translated into Turkish and trialled on teachers in Nigde province with acceptable reliability and factorial validity. The main purpose of…
Descriptors: Creativity, Teacher Behavior, Comparative Analysis, Turkish
Yang, Sophie Xin; Jowett, Sophia – Measurement in Physical Education and Exercise Science, 2013
The Coach-Athlete Relationship Questionnaire was developed to effectively measure affective, cognitive, and behavioral aspects, represented by the interpersonal constructs of closeness, commitment, and complementarity, of the quality of the relationship within the context of sport coaching. The current study sought to determine the internal…
Descriptors: Foreign Countries, Athletes, Athletic Coaches, Interpersonal Relationship
Bechger, Timo M.; Maris, Gunter; Hsiao, Ya Ping – Applied Psychological Measurement, 2010
The main purpose of this article is to demonstrate how halo effects may be detected and quantified using two independent ratings of the same person. A practical illustration is given to show how halo effects can be avoided. (Contains 2 tables, 7 figures, and 2 notes.)
Descriptors: Performance Based Assessment, Test Reliability, Test Length, Language Tests
Arens, A. Katrin; Yeung, Alexander Seeshing; Craven, Rhonda G.; Hasselhorn, Marcus – International Journal of Research & Method in Education, 2013
This study aims to develop a short German version of the Self Description Questionnaire (SDQ I-GS) in order to present a robust economical instrument for measuring German preadolescents' multidimensional self-concept. A full German version of the SDQ I (SDQ I-G) that maintained the original structure and thus length of the English original SDQ I…
Descriptors: Foreign Countries, Questionnaires, Test Construction, Test Length
Bulut, Okan; Kan, Adnan – Eurasian Journal of Educational Research, 2012
Problem Statement: Computerized adaptive testing (CAT) is a sophisticated and efficient way of delivering examinations. In CAT, items for each examinee are selected from an item bank based on the examinee's responses to the items. In this way, the difficulty level of the test is adjusted based on the examinee's ability level. Instead of…
Descriptors: Adaptive Testing, Computer Assisted Testing, College Entrance Examinations, Graduate Students

Mulhern, Fiona; Rae, Gordon – Educational and Psychological Measurement, 1998
Data from 196 Irish school children were analyzed and used to develop a shortened version of the Fennema-Sherman Mathematics Attitudes Scales (E. Fennema and J. Sherman, 1976). Internal consistency estimates of the reliability of scores on the whole scale and each of the subscales of the original and short form were favorable. (SLD)
Descriptors: Attitude Measures, Elementary Education, Elementary School Students, Foreign Countries
Wang, Wen-Chung; Chen, Hsueh-Chu – Educational and Psychological Measurement, 2004
As item response theory (IRT) becomes popular in educational and psychological testing, there is a need of reporting IRT-based effect size measures. In this study, we show how the standardized mean difference can be generalized into such a measure. A disattenuation procedure based on the IRT test reliability is proposed to correct the attenuation…
Descriptors: Test Reliability, Rating Scales, Sample Size, Error of Measurement
Previous Page | Next Page »
Pages: 1 | 2