ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	13

Descriptor

Foreign Countries	18
Test Length	18
Test Reliability	18
Test Validity	9
Test Construction	5
Comparative Analysis	4
Computer Assisted Testing	4
Difficulty Level	4
Correlation	3
Factor Analysis	3
High Stakes Tests	3
Item Response Theory	3
Language Tests	3
Questionnaires	3
Adaptive Testing	2
Elementary School Students	2
Evaluation Criteria	2
Factor Structure	2
Goodness of Fit	2
Listening Comprehension Tests	2
Psychometrics	2
Scores	2
Secondary School Teachers	2
Statistical Analysis	2
Test Format	2
More ▼

Source

Educational and Psychological…	2
Research Matters	2
African Educational Research…	1
Applied Psychological…	1
Eurasian Journal of…	1
European Journal of Special…	1
International Journal of…	1
Journal of Deaf Studies and…	1
Journal of Psychoeducational…	1
Language Testing	1
Measurement in Physical…	1
Online Submission	1
SAGE Open	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	14
Reports - Descriptive	3
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Higher Education	3
Postsecondary Education	3
Secondary Education	3
Elementary Education	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

China	4
Turkey	3
Australia	2
Canada	2
Ireland	2
Netherlands	2
Singapore	2
United Kingdom	2
Germany	1
Japan	1
Kenya	1
New Zealand	1
Poland	1
Portugal	1
South Korea	1
Taiwan	1
United Kingdom (England)	1
United States	1
Vermont	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Fennema Sherman Mathematics…	1
National Assessment of…	1
Self Description Questionnaire	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Effect of Sample Length on MLU in Mandarin-Speaking Hard-of-Hearing Children

Peer reviewed

Direct link

Chia-Ying Chu; Pei-Hua Chen; Yi-Shin Tsai; Chieh-An Chen; Yi-Chih Chan; Yan-Jhe Ciou – Journal of Deaf Studies and Deaf Education, 2024

This study investigated the impact of language sample length on mean length of utterance (MLU) and aimed to determine the minimum number of utterances required for a reliable MLU. Conversations were collected from Mandarin-speaking, hard-of-hearing and typical-hearing children aged 16-81 months. The MLUs were calculated using sample sizes ranging…

Descriptors: Foreign Countries, Mandarin Chinese, Young Children, Language Acquisition

How Long Should a High Stakes Test Be?

Download full text

Tom Benton – Research Matters, 2024

Educational assessment is used throughout the world for a range of different formative and summative purposes. Wherever an assessment is developed, whether by a teacher creating a quiz for their class, or by a testing company creating a high stakes assessment, it is necessary to decide how long the test should be. Specifically, how many questions…

Descriptors: Foreign Countries, High Stakes Tests, Test Length, Test Construction

Test Review: Computer-Based English Listening and Speaking Test (CELST) of National Matriculation English Test (NMET) Guangdong Version in China

Peer reviewed

Direct link

Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025

This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…

Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests

Item Response Theory, Computer Adaptive Testing and the Risk of Self-Deception

Download full text

Benton, Tom – Research Matters, 2021

Computer adaptive testing is intended to make assessment more reliable by tailoring the difficulty of the questions a student has to answer to their level of ability. Most commonly, this benefit is used to justify the length of tests being shortened whilst retaining the reliability of a longer, non-adaptive test. Improvements due to adaptive…

Descriptors: Risk, Item Response Theory, Computer Assisted Testing, Difficulty Level

Academic Expectations Questionnaire: A Proposal for a Short Version

Peer reviewed

Direct link

Casanova, Joana R.; Almeida, Leandro S.; Peixoto, Francisco; Ribeiro, Rui-Bártolo; Marôco, João – SAGE Open, 2019

Academic expectations play a significant role in the quality of student adaptation and academic success. Previous research suggests that expectations are a multidimensional construct, making it crucial to test the measures used for this important characteristic. Because assessment of student adaptation to higher education comprises a multitude of…

Descriptors: Foreign Countries, College Freshmen, Questionnaires, Expectation

The Big Three Perfectionism Scale--Short Form (BTPS-SF): Development of a Brief Self-Report Measure of Multidimensional Perfectionism

Peer reviewed

Direct link

Feher, Anita; Smith, Martin M.; Saklofske, Donald H.; Plouffe, Rachel A.; Wilson, Claire A.; Sherry, Simon B. – Journal of Psychoeducational Assessment, 2020

The Big Three Perfectionism Scale (BTPS) is a 45-item self-report measure of perfectionism with three overarching factors: rigid, self-critical, and narcissistic perfectionism. Our objective was to create a brief version of the BTPS, the Big Three Perfectionism Scale--Short Form (BTPS-SF). Sixteen items were selected, and confirmatory factor…

Descriptors: Personality Measures, Personality Traits, Test Construction, Measurement Techniques

Comparison of Two Test Methods for VIS: Paper-Pencil Test and CAT

Peer reviewed

Direct link

Senel, Selma; Kutlu, Ömer – European Journal of Special Needs Education, 2018

This paper examines listening comprehension skills of visually impaired students (VIS) using computerised adaptive testing (CAT) and reader-assisted paper-pencil testing (raPPT) and student views about them. Explanatory mixed method design was used in this study. Sample is comprised of 51 VIS, in 7th and 8th grades. 9 of these students were…

Descriptors: Computer Assisted Testing, Adaptive Testing, Visual Impairments, Student Attitudes

Validity and Reliability of Teacher-Made Tests: Case Study of Year 11 Physics in Nyahururu District of Kenya

Peer reviewed
PDF on ERIC

Download full text

Kinyua, Kiragu; Okunya, Luke Odiemo – African Educational Research Journal, 2014

This study was carried out to establish the factors influencing the validity and reliability of teacher made tests in Kenya. It was conducted in Nyahururu District of Laikipia County in Kenya. The study involved 42 teachers and 15 key informants selected from teachers holding various positions of academic responsibilities in their schools in…

Descriptors: Tests, Test Validity, Test Reliability, Physics

Indexing Creativity Fostering Teacher Behaviour: Replication and Modification

Download full text

Dikici, Ayhan; Soh, Kaycheng – Online Submission, 2015

Many measurement tools on creativity are available in the literature. One of these scales is Creativity Fostering Teacher Behaviour Index (CFTIndex) developed for Singaporean teacher originally. It was then translated into Turkish and trialled on teachers in Nigde province with acceptable reliability and factorial validity. The main purpose of…

Descriptors: Creativity, Teacher Behavior, Comparative Analysis, Turkish

The Psychometric Properties of the Short and Long Versions of the Coach-Athlete Relationship Questionnaire

Peer reviewed

Direct link

Yang, Sophie Xin; Jowett, Sophia – Measurement in Physical Education and Exercise Science, 2013

The Coach-Athlete Relationship Questionnaire was developed to effectively measure affective, cognitive, and behavioral aspects, represented by the interpersonal constructs of closeness, commitment, and complementarity, of the quality of the relationship within the context of sport coaching. The current study sought to determine the internal…

Descriptors: Foreign Countries, Athletes, Athletic Coaches, Interpersonal Relationship

Detecting Halo Effects in Performance-Based Examinations

Peer reviewed

Direct link

Bechger, Timo M.; Maris, Gunter; Hsiao, Ya Ping – Applied Psychological Measurement, 2010

The main purpose of this article is to demonstrate how halo effects may be detected and quantified using two independent ratings of the same person. A practical illustration is given to show how halo effects can be avoided. (Contains 2 tables, 7 figures, and 2 notes.)

Descriptors: Performance Based Assessment, Test Reliability, Test Length, Language Tests

A Short German Version of the Self Description Questionnaire I: Theoretical and Empirical Comparability

Peer reviewed

Direct link

Arens, A. Katrin; Yeung, Alexander Seeshing; Craven, Rhonda G.; Hasselhorn, Marcus – International Journal of Research & Method in Education, 2013

This study aims to develop a short German version of the Self Description Questionnaire (SDQ I-GS) in order to present a robust economical instrument for measuring German preadolescents' multidimensional self-concept. A full German version of the SDQ I (SDQ I-G) that maintained the original structure and thus length of the English original SDQ I…

Descriptors: Foreign Countries, Questionnaires, Test Construction, Test Length

Application of Computerized Adaptive Testing to Entrance Examination for Graduate Studies in Turkey

Peer reviewed
PDF on ERIC

Download full text

Bulut, Okan; Kan, Adnan – Eurasian Journal of Educational Research, 2012

Problem Statement: Computerized adaptive testing (CAT) is a sophisticated and efficient way of delivering examinations. In CAT, items for each examinee are selected from an item bank based on the examinee's responses to the items. In this way, the difficulty level of the test is adjusted based on the examinee's ability level. Instead of…

Descriptors: Adaptive Testing, Computer Assisted Testing, College Entrance Examinations, Graduate Students

Development of a Shortened Form of the Fennema-Sherman Mathematics Attitudes Scales.

Peer reviewed

Mulhern, Fiona; Rae, Gordon – Educational and Psychological Measurement, 1998

Data from 196 Irish school children were analyzed and used to develop a shortened version of the Fennema-Sherman Mathematics Attitudes Scales (E. Fennema and J. Sherman, 1976). Internal consistency estimates of the reliability of scores on the whole scale and each of the subscales of the original and short form were favorable. (SLD)

Descriptors: Attitude Measures, Elementary Education, Elementary School Students, Foreign Countries

The Standardized Mean Difference within the Framework of Item Response Theory

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hsueh-Chu – Educational and Psychological Measurement, 2004

As item response theory (IRT) becomes popular in educational and psychological testing, there is a need of reporting IRT-based effect size measures. In this study, we show how the standardized mean difference can be generalized into such a measure. A disattenuation procedure based on the IRT test reliability is proposed to correct the attenuation…

Descriptors: Test Reliability, Rating Scales, Sample Size, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2

Almeida, Leandro S.	1
Arens, A. Katrin	1
Bechger, Timo M.	1
Benton, Tom	1
Bulut, Okan	1
Casanova, Joana R.	1
Catts, Ralph	1
Chen, Hsueh-Chu	1
Chia-Ying Chu	1
Chieh-An Chen	1
Craven, Rhonda G.	1
Dikici, Ayhan	1
Feher, Anita	1
Freedman, Sarah Warshauer	1
Hasselhorn, Marcus	1
Hsiao, Ya Ping	1
Jin Chen	1
Jowett, Sophia	1
Kan, Adnan	1
Kinyua, Kiragu	1
Kutlu, Ömer	1
Maris, Gunter	1
Marôco, João	1
Mulhern, Fiona	1
Okunya, Luke Odiemo	1
More ▼