Publication Date
| In 2026 | 0 |
| Since 2025 | 10 |
| Since 2022 (last 5 years) | 54 |
| Since 2017 (last 10 years) | 97 |
| Since 2007 (last 20 years) | 163 |
Descriptor
| Test Format | 506 |
| Test Validity | 506 |
| Test Reliability | 243 |
| Test Construction | 180 |
| Test Items | 127 |
| Foreign Countries | 108 |
| Language Tests | 96 |
| Higher Education | 86 |
| Testing | 80 |
| Computer Assisted Testing | 72 |
| Test Use | 67 |
| More ▼ | |
Source
Author
Publication Type
Education Level
| Higher Education | 60 |
| Postsecondary Education | 50 |
| Secondary Education | 30 |
| Elementary Education | 25 |
| Middle Schools | 19 |
| Junior High Schools | 15 |
| High Schools | 13 |
| Grade 8 | 11 |
| Grade 4 | 9 |
| Elementary Secondary Education | 8 |
| Grade 5 | 8 |
| More ▼ | |
Audience
| Practitioners | 30 |
| Teachers | 19 |
| Administrators | 17 |
| Researchers | 9 |
| Community | 1 |
| Policymakers | 1 |
| Students | 1 |
| Support Staff | 1 |
Location
| Canada | 10 |
| China | 9 |
| New York | 9 |
| Japan | 7 |
| Netherlands | 6 |
| Germany | 5 |
| Turkey | 5 |
| United Kingdom | 5 |
| United Kingdom (England) | 5 |
| Australia | 4 |
| Georgia | 4 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
| Individuals with Disabilities… | 1 |
| Job Training Partnership Act… | 1 |
| No Child Left Behind Act 2001 | 1 |
| Pell Grant Program | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Hassan, Nurul Huda; Shih, Chih-Min – Language Assessment Quarterly, 2013
This article describes and reviews the Singapore-Cambridge General Certificate of Education Advanced Level General Paper (GP) examination. As a written test that is administered to preuniversity students, the GP examination is internationally recognised and accepted by universities and employers as proof of English competence. In this article, the…
Descriptors: Foreign Countries, College Entrance Examinations, English (Second Language), Writing Tests
Read, John; von Randow, Janet – International Journal of English Studies, 2013
The increasingly diverse language backgrounds of their students are creating new challenges for English-medium universities. One response in Australian and New Zealand institutions has been to introduce post-entry language assessment (PELA) to identify incoming students who need to enhance their academic language ability. One successful example of…
Descriptors: Foreign Countries, Language Tests, Academic Discourse, Diagnostic Tests
Wan, Lei; Henly, George A. – Applied Measurement in Education, 2012
Many innovative item formats have been proposed over the past decade, but little empirical research has been conducted on their measurement properties. This study examines the reliability, efficiency, and construct validity of two innovative item formats--the figural response (FR) and constructed response (CR) formats used in a K-12 computerized…
Descriptors: Test Items, Test Format, Computer Assisted Testing, Measurement
Sasaki, Miyuki – Language Testing, 2012
The Modern Language Aptitude Test (Paper-and-Pencil Version, henceforth, the MLAT) measures "an individual's ability to learn a foreign language." It targets English-speaking adults (over Grade 9) who are literate. The test has only one form, which has not changed since it was first published by the Psychological Corporation in 1959. The test can…
Descriptors: Aptitude Tests, Test Reviews, Rewards, Acoustics
Du, Li – English Language Teaching, 2010
The paper analyzes the variables that can influence the validity of the reading part in the standardized test of CET (College English Test). They refer to the test methods and test content. As for the test method, the widely used are multiple-choice questions and short answer questions which can influence the test takers' performance and therefore…
Descriptors: Reading Tests, Test Validity, Predictor Variables, Performance Factors
Cheng, Liying; DeLuca, Christopher – Educational Assessment, 2011
Test-takers' interpretations of validity as related to test constructs and test use have been widely debated in large-scale language assessment. This study contributes further evidence to this debate by examining 59 test-takers' perspectives in writing large-scale English language tests. Participants wrote about their test-taking experiences in…
Descriptors: Language Tests, Test Validity, Test Use, English
Camilli, Gregory – Educational Research and Evaluation, 2013
In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…
Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format
Kalaycioglu, Dilara Bakan; Berberoglu, Giray – Journal of Psychoeducational Assessment, 2011
This study is aimed to detect differential item functioning (DIF) items across gender groups, analyze item content for the possible sources of DIF, and eventually investigate the effect of DIF items on the criterion-related validity of the test scores in the quantitative section of the university entrance examination (UEE) in Turkey. The reason…
Descriptors: Test Bias, College Entrance Examinations, Item Analysis, Test Items
Ediger, Marlow – Education, 2010
Data driven decision making emphasizes the importance of the teacher using objective sources of information in developing the social studies curriculum. Too frequently, decisions of teachers have been made based on routine and outdated methods of teaching. Valid and reliable tests used to secure results from pupil learning make for better…
Descriptors: Data, Decision Making, Social Studies, Standardized Tests
Bae, Jungok; Lee, Yae-Sheik – Language Testing, 2011
Pictures are widely used to elicit expressive language skills, and pictures must be established as parallel before changes in ability can be demonstrated by assessment using pictures prompts. Why parallel prompts are required and what it is necessary to do to ensure that prompts are in fact parallel is not widely known. To date, evidence of…
Descriptors: Second Language Learning, Test Format, Language Tests, Factor Analysis
Alemi, Minoo; Miraghaee, Apama – Journal on English Language Teaching, 2011
The present study was carried out to find out whether regular administration of cloze test improved the students' knowledge of grammar more than the multiple choice one. Subjects participating in this study were 84 Iranian pre-university students of Allameh-Gotb-e Ravandi University, aged between 18 and 35 and enrolled in a grammar course. To…
Descriptors: Foreign Countries, Comparative Analysis, Grammar, Knowledge Level
Jin, Yan – Journal of Pan-Pacific Association of Applied Linguistics, 2011
The College English Test (CET) is an English language test designed for educational purposes, administered on a very large scale, and used for making high-stakes decisions. This paper discusses the key issues facing the CET during the course of its development in the past two decades. It argues that the most fundamental and critical concerns of…
Descriptors: High Stakes Tests, Language Tests, Measures (Individuals), Graduates
Roberts, William L.; McKinley, Danette W.; Boulet, John R. – Advances in Health Sciences Education, 2010
Due to the high-stakes nature of medical exams it is prudent for test agencies to critically evaluate test data and control for potential threats to validity. For the typical multiple station performance assessments used in medicine, it may take time for examinees to become comfortable with the test format and administrative protocol. Since each…
Descriptors: Student Evaluation, Pretests Posttests, Licensing Examinations (Professions), Scores
McGaw, Barry – Assessment in Education: Principles, Policy & Practice, 2008
In their reactions to my paper, the four authors provide comments that are illuminating and helpful for continuing discussions of the nature and utility of quantitative, comparative, international studies of educational achievement. In this response, I comment further on the issues of test characteristics, sample design, culture and causation.
Descriptors: Test Format, International Studies, Academic Achievement, Evaluation
Hendrickson, Amy; Patterson, Brian; Ewing, Maureen – College Board, 2010
The psychometric considerations and challenges associated with including constructed response items on tests are discussed along with how these issues affect the form assembly specifications for mixed-format exams. Reliability and validity, security and fairness, pretesting, content and skills coverage, test length and timing, weights, statistical…
Descriptors: Multiple Choice Tests, Test Format, Test Construction, Test Validity

Peer reviewed
Direct link
