Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 16 |
Descriptor
Correlation | 26 |
Test Format | 26 |
Test Items | 11 |
Foreign Countries | 9 |
Scores | 7 |
Multiple Choice Tests | 6 |
Test Validity | 6 |
College Entrance Examinations | 5 |
Comparative Analysis | 5 |
Language Tests | 5 |
Test Use | 5 |
More ▼ |
Source
Author
Allalouf, Avi | 1 |
Bunch, Michael B. | 1 |
Cawthon, Stephanie | 1 |
Cheng, Liying | 1 |
Christensen, Bruce K. | 1 |
Clarke, Rufus | 1 |
Craig, Pippa | 1 |
Dimova, Slobodanka | 1 |
Dunlap, Angel L. | 1 |
Eun, Barohny | 1 |
Gaston, Michele F. | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 26 |
Journal Articles | 20 |
Speeches/Meeting Papers | 4 |
Opinion Papers | 1 |
Reports - Research | 1 |
Education Level
Higher Education | 5 |
Postsecondary Education | 3 |
Elementary Secondary Education | 2 |
Secondary Education | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 3 | 1 |
Primary Education | 1 |
Audience
Practitioners | 1 |
Teachers | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
SAT (College Admission Test) | 3 |
Minnesota Multiphasic… | 2 |
Millon Clinical Multiaxial… | 1 |
Program for International… | 1 |
Remote Associates Test | 1 |
Wechsler Intelligence Scale… | 1 |
Wechsler Intelligence Scales… | 1 |
What Works Clearinghouse Rating
Selcuk Acar; Yuyang Shen – Journal of Creative Behavior, 2025
Creativity tests, like creativity itself, vary widely in their structure and use. These differences include instructions, test duration, environments, prompt and response modalities, and the structure of test items. A key factor is task structure, referring to the specificity of the number of responses requested for a given prompt. Classic…
Descriptors: Creativity, Creative Thinking, Creativity Tests, Task Analysis
Dimova, Slobodanka – Language Teaching Research Quarterly, 2022
Drawing on Glenn Fulcher's extensive work in performance-based language assessment of speaking, this paper explores the assessment of L2 speaking ability in local language testing contexts. For that purpose, I review Fulcher's influential work that highlights the relationship between the speaking construct, the task, the performance, and the…
Descriptors: Language Tests, Speech Communication, Performance Based Assessment, Second Language Learning
Eun, Barohny; Knotek, Steven E. – Research in Education, 2022
A Vygotskian approach to assessment is proposed by invoking the distinction between the development of lower and higher psychological functions. Higher psychological functions are specifically human and develop with the use of cultural tools via mediation. Accordingly, a distinction is made between tests that are based on association, which have…
Descriptors: Evaluation Methods, Sociocultural Patterns, Psychological Patterns, Teaching Methods
Cawthon, Stephanie – American Annals of the Deaf, 2011
Linguistic complexity of test items is one test format element that has been studied in the context of struggling readers and their participation in paper-and-pencil tests. The present article presents findings from an exploratory study on the potential relationship between linguistic complexity and test performance for deaf readers. A total of 64…
Descriptors: Language Styles, Test Content, Syntax, Linguistics
Kim, Sooyeon; Walker, Michael E. – Educational Testing Service, 2011
This study examines the use of subpopulation invariance indices to evaluate the appropriateness of using a multiple-choice (MC) item anchor in mixed-format tests, which include both MC and constructed-response (CR) items. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using an MC-only anchor set for 4…
Descriptors: Test Format, Multiple Choice Tests, Test Items, Gender Differences
Pae, Tae-Il – Language Testing, 2012
This study tracked gender differential item functioning (DIF) on the English subtest of the Korean College Scholastic Aptitude Test (KCSAT) over a nine-year period across three data points, using both the Mantel-Haenszel (MH) and item response theory likelihood ratio (IRT-LR) procedures. Further, the study identified two factors (i.e. reading…
Descriptors: Aptitude Tests, Academic Aptitude, Language Tests, Test Items
Using a Two-Tier Test in Examining Taiwan Graduate Students' Perspectives on Paraphrasing Strategies
Sun, Yu-Chih – Asia Pacific Education Review, 2009
This study examines Taiwanese English as a foreign language (EFL) graduate students' perspectives on paraphrasing strategies. A two-layer scenario survey was developed to identify the reasoning behind students' judgments that certain paraphrasing is appropriate or inappropriate. The first-layer scenario survey is in a true-false format that…
Descriptors: Graduate Students, Student Attitudes, Foreign Countries, English (Second Language)
Allalouf, Avi; Rapp, Joel; Stoller, Reuven – International Journal of Testing, 2009
When a test is adapted from a source language (SL) into a target language (TL), the two forms are usually not psychometrically equivalent. If linking between test forms is necessary, those items that have had their psychometric characteristics altered by the translation (differential item functioning [DIF] items) should be eliminated from the…
Descriptors: Test Items, Test Format, Verbal Tests, Psychometrics
Girard, Todd A.; Christensen, Bruce K. – Psychological Assessment, 2008
The correlation between a short-form (SF) test and its full-scale (FS) counterpart is a mainstay in the evaluation of SF validity. However, in correcting for overlapping error variance in this measure, investigators have overattenuated the validity coefficient through an intuitive misapplication of P. Levy's (1967) formula. The authors of the…
Descriptors: Error of Measurement, Computation, Psychiatric Services, Correlation
Nehm, Ross H.; Schonfeld, Irvin Sam – Journal of Research in Science Teaching, 2008
Growing recognition of the central importance of fostering an in-depth understanding of natural selection has, surprisingly, failed to stimulate work on the development and rigorous evaluation of instruments that measure knowledge of it. We used three different methodological tools, the Conceptual Inventory of Natural Selection (CINS), a modified…
Descriptors: Evolution, Science Education, Interviews, Measures (Individuals)
Kim, Seonghoon; Kolen, Michael J. – Applied Measurement in Education, 2006
Four item response theory linking methods (2 moment methods and 2 characteristic curve methods) were compared to concurrent (CO) calibration with the focus on the degree of robustness to format effects (FEs) when applying the methods to multidimensional data that reflected the FEs associated with mixed-format tests. Based on the quantification of…
Descriptors: Item Response Theory, Robustness (Statistics), Test Format, Comparative Analysis
Craig, Pippa; Gordon, Jill; Clarke, Rufus; Oldmeadow, Wendy – Assessment & Evaluation in Higher Education, 2009
This study aimed to provide evidence to guide decisions on the type and timing of assessments in a graduate medical programme, by identifying whether students from particular degree backgrounds face greater difficulty in satisfying the current assessment requirements. We examined the performance rank of students in three types of assessments and…
Descriptors: Student Evaluation, Medical Education, Student Characteristics, Correlation
Lundervold, Duane A.; Dunlap, Angel L. – International Journal of Behavioral Consultation and Therapy, 2006
Alternate forms reliability of the Behavioral Relaxation Scale (BRS; Poppen,1998), a direct observation measure of relaxed behavior, was examined. A single BRS score, based on long duration observation (5-minute), has been found to be a valid measure of relaxation and is correlated with self-report and some physiological measures. Recently,…
Descriptors: Test Format, Intervals, Observation, Measures (Individuals)

Schriesheim, Chester A. – Educational and Psychological Measurement, 1981
This study provides support for the hypothesized effect of leniency on the discriminant validity of grouped questionnaire items. It was found that controlling for leniency resulted in a slight decrement in convergent validity but that discriminant validity was substantially improved. Implications for questionnaire validity and further research are…
Descriptors: Classification, Correlation, Questionnaires, Research Problems

Guttman, Louis; Levy, Shlomit – Intelligence, 1991
Two structural laws for intelligence tests are discussed: one law concerns the sign of correlation coefficients and gives conditions under which all correlations between test items will be positive; and one law concerns the relative sizes of the correlation coefficients between intelligence items. A cylindrical structure extends these laws. (SLD)
Descriptors: Correlation, Foreign Countries, Intelligence Tests, Test Construction
Previous Page | Next Page ยป
Pages: 1 | 2