Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 9 |
Descriptor
Construct Validity | 25 |
Standardized Tests | 25 |
Test Validity | 25 |
Test Reliability | 9 |
Language Tests | 8 |
English (Second Language) | 7 |
Test Construction | 7 |
Evaluation Methods | 5 |
Higher Education | 5 |
Second Language Instruction | 5 |
Second Language Learning | 5 |
More ▼ |
Source
Author
Nakamura, Yuji | 2 |
Ackerman, Terry A. | 1 |
Ahn, Jieun Irene | 1 |
Arnault, Lynne S. | 1 |
Banta, Trudy W. | 1 |
Bartoli, Eleonora | 1 |
Bellamy, Scarlett | 1 |
Bertsch, Kristin N. | 1 |
Briggs, Derek C. | 1 |
Chang, Hyung-ji | 1 |
Cheng, Britte H. | 1 |
More ▼ |
Publication Type
Journal Articles | 19 |
Reports - Evaluative | 12 |
Reports - Research | 12 |
Speeches/Meeting Papers | 3 |
Opinion Papers | 2 |
Dissertations/Theses -… | 1 |
Education Level
Audience
Researchers | 3 |
Location
Japan | 1 |
New York | 1 |
South Korea | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Bayley Scales of Infant… | 1 |
California Psychological… | 1 |
Scales of Independent Behavior | 1 |
Test of English as a Foreign… | 1 |
Test of English for… | 1 |
Test of Written English | 1 |
What Works Clearinghouse Rating
Gillem, Angela R.; Bartoli, Eleonora; Bertsch, Kristin N.; McCarthy, Maureen A.; Constant, Kerra; Marrero-Meisky, Sheila; Robbins, Steven J.; Bellamy, Scarlett – Journal of Multicultural Counseling and Development, 2016
The Multicultural Counseling and Psychotherapy Test (MCPT), a measure of multicultural counseling competence (MCC), was validated in 2 phases. In Phase 1, the authors administered 451 test items derived from multicultural guidelines in counseling and psychology to 32 multicultural experts and 30 nonexperts. In Phase 2, the authors administered the…
Descriptors: Counseling Techniques, Cultural Relevance, Counselor Qualifications, Expertise
Foghahaee, Zahra – Language Teaching Research Quarterly, 2019
Reverse engineering (RE) can play an important role in the re-designing tests in L2 English. It can also enrich the aim of teaching the same as raising children through academic achievement. In addition, it can play a key role in helping students understand how much their test is valid by using Standard reverse engineering (SRE). This paper is a…
Descriptors: Language Tests, Second Language Learning, Second Language Instruction, English (Second Language)
Winke, Paula; Lee, Shinhye; Ahn, Jieun Irene; Choi, Ina; Cui, Yaqiong; Yoon, Hyung-Jo – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2018
This study investigated the cognitive validity of two child English language tests. Some teachers maintain that these types of tests may be cognitively invalid because native-English-speaking children would not do well on them (Winke, 2011). So the researchers had native speakers and learners of English aged 7 to 9 take sample versions of two…
Descriptors: Language Tests, English, English (Second Language), Second Language Learning
Kang, Mun-koo; Chang, Hyung-ji – Journal of Pan-Pacific Association of Applied Linguistics, 2014
This study is aimed at ensuring the validity of the Practical English Certification Test (PECT) of the Chung-nam Office of Education (COE) in Korea. Motivated by the demand for a developing localized English test to empower English learning in public education, the COE conducted the PECT for 38,544 students of elementary, middle and high schools…
Descriptors: Foreign Countries, English (Second Language), Second Language Learning, Language Tests
Pentimonti, Jill M.; Zucker, Tricia A.; Justice, Laura M.; Petscher, Yaacov; Piasta, Shayne B.; Kaderavek, Joan N. – Early Childhood Research Quarterly, 2012
Participation in shared-reading experiences is associated with children's language and literacy outcomes, yet few standardized assessments of shared-reading quality exist. The purpose of this study was to describe the psychometric characteristics of the Systematic Assessment of Book Reading (SABR), an observational tool designed to characterize…
Descriptors: Test Validity, Construct Validity, Interrater Reliability, Factor Structure
Mislevy, Robert J.; Haertel, Geneva; Cheng, Britte H.; Ructtinger, Liliana; DeBarger, Angela; Murray, Elizabeth; Rose, David; Gravel, Jenna; Colker, Alexis M.; Rutstein, Daisy; Vendlinski, Terry – Educational Research and Evaluation, 2013
Standardizing aspects of assessments has long been recognized as a tactic to help make evaluations of examinees fair. It reduces variation in irrelevant aspects of testing procedures that could advantage some examinees and disadvantage others. However, recent attention to making assessment accessible to a more diverse population of students…
Descriptors: Testing Accommodations, Access to Education, Testing, Psychometrics
Lee, Young-Sun; Lembke, Erica; Moore, Douglas; Ginsburg, Herbert P.; Pappas, Sandra – Assessment for Effective Intervention, 2012
The present study examined the technical adequacy of curriculum-based measures (CBMs) of early numeracy. Six 1-min early mathematics tasks were administered to 137 kindergarten and first-grade students, along with an omnibus test of early mathematics. The CBM measures included Count Out Loud, Quantity Discrimination, Number Identification, Missing…
Descriptors: Numeracy, Curriculum Based Assessment, Mathematics Tests, Kindergarten
Briggs, Derek C. – Educational Researcher, 2008
When causal inferences are to be synthesized across multiple studies, efforts to establish the magnitude of a causal effect should be balanced by an effort to evaluate the generalizability of the effect. The evaluation of generalizability depends on two factors that are given little attention in current syntheses: construct validity and external…
Descriptors: Test Validity, Construct Validity, Inferences, Educational Policy
Itomitsu, Masayuki – ProQuest LLC, 2009
This dissertation reports development and validation studies of a Web-based standardized test of Japanese as a foreign language (JFL), designed to measure learners' off-line grammatical and pragmatic knowledge in multiple-choice format. Targeting Japanese majors in the U.S. universities and colleges, the test is designed to explore possible…
Descriptors: Sentences, Speech Acts, Grammar, Second Language Learning

Pike, Gary R. – Review of Higher Education, 1989
A study investigated the appropriateness of the American College Testing Program's College Outcome Measures Program, conducted at the University of Tennessee, Knoxville, by applying the criterion of construct validity. Results indicated that while the test primarily measures individual differences, it is also sensitive to the effects of higher…
Descriptors: Construct Validity, Educational Quality, Evaluation Criteria, Higher Education
Roemer, Ann – College and University, 2002
Describes the Test of English as a Foreign Language (TOEFL) and the Advanced Placement in International English Language (APIEL) and evaluates both tests on three basic types of validity criteria: content, construct, and criterion-related. Concludes that the TOEFL has serious limitations, and that the APIEL may be more useful. (EV)
Descriptors: Construct Validity, Content Validity, English (Second Language), Foreign Students
Li, Yuan H.; Tompkins, Leroy J. – International Journal of Testing, 2004
The primary objective of this study was to examine the construct validity for the 2 multiple-content testing programs-the multiple-choice Comprehensive Tests of Basic Skills (CTBS/5) together with the performance-based Maryland School Performance Assessment Program (MSPAP)-by evaluating the true-score longitudinal associations among…
Descriptors: Testing Programs, Structural Equation Models, Performance Based Assessment, Multitrait Multimethod Techniques
Nolet, Victor; Tindal, Gerald – 1990
Valid interpretation of test scores is the shared responsibility of the test designer and the test user. Test publishers must provide evidence of the validity of the decisions their tests are intended to support, while test users are responsible for analyzing this evidence and subsequently using the test in the manner indicated by the publisher.…
Descriptors: Achievement Tests, Construct Validity, Elementary Secondary Education, Norm Referenced Tests
Snyder, Scott; Sheehan, Robert – Diagnostique, 1992
Rasch calibration procedures were applied to item-response data for the 1,262 infants and toddlers comprising the standardization sample for the Mental Scale of the Bayley Scales of Infant Development. Analyses tend to confirm the psychometric integrity of the instrument. (Author)
Descriptors: Child Development, Cognitive Tests, Concurrent Validity, Construct Validity
Walker, David W.; Arnault, Lynne S. – Diagnostique, 1991
This study examined the construct validity of the KeyMath-Revised by testing the factorial model proposed by the test author. Results failed to confirm the proposed factorial model and suggested that the KeyMath-Revised assesses two domains that are difficult to interpret, rather than the three proposed by the test author. (Author/JDD)
Descriptors: Achievement Tests, Construct Validity, Diagnostic Tests, Elementary Secondary Education
Previous Page | Next Page ยป
Pages: 1 | 2