Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 14 |
Since 2006 (last 20 years) | 124 |
Descriptor
Statistical Analysis | 194 |
Validity | 112 |
Test Validity | 54 |
Reliability | 43 |
Measures (Individuals) | 34 |
Foreign Countries | 33 |
Research Methodology | 29 |
Evaluation Methods | 28 |
Models | 27 |
Scores | 27 |
Test Reliability | 26 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 32 |
Elementary Secondary Education | 20 |
Postsecondary Education | 17 |
Elementary Education | 13 |
High Schools | 9 |
Secondary Education | 9 |
Grade 8 | 7 |
Grade 3 | 6 |
Grade 7 | 6 |
Grade 1 | 4 |
Grade 2 | 4 |
More ▼ |
Location
Australia | 5 |
United Kingdom | 4 |
United States | 4 |
Florida | 3 |
Michigan | 3 |
Netherlands | 3 |
Wisconsin | 3 |
California | 2 |
Canada (Toronto) | 2 |
China | 2 |
Germany | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Debra P v Turlington | 1 |
Race to the Top | 1 |
Safe and Drug Free Schools… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Benton, Tom – Research Matters, 2020
This article reviews the evidence on the extent to which experts' perceptions of item difficulties, captured using comparative judgement, can predict empirical item difficulties. This evidence is drawn from existing published studies on this topic and also from statistical analysis of data held by Cambridge Assessment. Having reviewed the…
Descriptors: Test Items, Difficulty Level, Expertise, Comparative Analysis
Krenzke, Tom; Mohadjer, Leyla; Li, Jianzhu; Erciulescu, Andreea; Fay, Robert; Ren, Weijia; Van de Kerckhove, Wendy; Li, Lin; Rao, J. N. K. – National Center for Education Statistics, 2020
The Program for the International Assessment of Adult Competencies (PIAAC) is a multicycle survey of adult skills and competencies sponsored by the Organization for Economic Cooperation and Development (OECD). The survey examines a range of basic skills in the information age and assesses these adult skills consistently across participating…
Descriptors: Adults, Surveys, Statistical Analysis, Computation
Benton, Stephen L.; Li, Dan – IDEA Center, Inc., 2019
Periodically, articles reporting research on student ratings of instruction (SRI), aka student evaluations of teaching, appear in the higher-education press. This literature often summarizes studies that challenge the validity and reliability of SRI. However, before drawing a conclusion about a quantitative study touted in the media, readers…
Descriptors: Credibility, Student Evaluation of Teacher Performance, Statistical Analysis, Evaluation Criteria
Barnow, Burt S.; Greenberg, David H. – American Journal of Evaluation, 2020
This paper reviews the use of multiple trials, defined as multiple sites or multiple arms in a single evaluation and replications, in evaluating social programs. After defining key terms, the paper discusses the rationales for conducting multiple trials, which include increasing sample size to increase statistical power; identifying the most…
Descriptors: Evaluation, Randomized Controlled Trials, Experiments, Replication (Evaluation)
Elicited Imitation as a Measure of Second Language Proficiency: A Narrative Review and Meta-Analysis
Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016
Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…
Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size
Wang, Jue; Engelhard, George, Jr. – Measurement: Interdisciplinary Research and Perspectives, 2016
The authors of the focus article describe an important issue related to the use and interpretation of causal indicators within the context of structural equation modeling (SEM). In the focus article, the authors illustrate with simulated data the effects of omitting a causal indicator. Since SEMs are used extensively in the social and behavioral…
Descriptors: Structural Equation Models, Measurement, Causal Models, Construct Validity
Holland, Charlotte; Lorenzi, Francesca; Hall, Tony – Policy Futures in Education, 2016
The current recessionary economic climate in Ireland has (re-) awakened a neoliberal agenda that is changing the dynamic of what is being valued within research assessment exercises, specifically across Arts, Humanities and Social Sciences (AHSS) disciplines in higher education. Research assessment exercises in AHSS disciplines now place a greater…
Descriptors: Foreign Countries, Anxiety, Higher Education, Performance
Förster, Manuel; Happ, Roland; Molerov, Dimitar – Journal of Economic Education, 2017
In this article, the authors present the adaptation and validation processes conducted to render the American "Test of Financial Literacy" (TFL) suitable for use in Germany (TFL-G). First, they outline the translation procedure followed and the various cultural adjustments made in line with international standards. Next, they present…
Descriptors: Money Management, Tests, Scores, Test Content
Guyon, Hervé; Tensaout, Mouloud – Measurement: Interdisciplinary Research and Perspectives, 2016
In this article, the authors extend the results of Aguirre-Urreta, Rönkkö, and Marakas (2016) concerning the omission of a relevant causal indicator by testing the validity of the assumption that causal indicators are entirely superfluous to the measurement model and discuss the implications for measurement theory. Contrary to common wisdom…
Descriptors: Causal Models, Structural Equation Models, Formative Evaluation, Measurement
Bowden, Stephen C. – Journal of Psychoeducational Assessment, 2013
In surveying the literature on assessment of cognitive abilities in adults and children, it is easy to assume that the proliferation of test batteries and terminology reflects a poverty of unifying models. However, the lack of recognition accorded good models of cognitive abilities may reflect inattention to theoretical development and injudicious…
Descriptors: Intelligence Tests, Intelligence, Adults, Children
Walstad, William B.; Rebeck, Ken – Journal of Economic Education, 2017
The "Test of Financial Literacy" (TFL) was created to measure the financial knowledge of high school students. Its content is based on the standards and benchmarks stated in the "National Standards for Financial Literacy" (Council for Economic Education 2013). The test development process involved extensive item writing and…
Descriptors: Tests, Money Management, Literacy, High School Students
Reisman, Fredricka; Keiser, Larry; Otti, Obinna – Creativity Research Journal, 2016
The Reisman Diagnostic Creativity Assessment (RDCA) is a free online self-report creativity assessment that provides immediate feedback to the user and is diagnostic, rather than predictive, with the focus on making the user aware of creative strengths and weaknesses. Several engineering and teacher education studies have included the RDCA over a…
Descriptors: Creativity, Creativity Tests, Creative Development, Computer Oriented Programs
Karami, Hossein – TESOL Journal, 2015
Factor analysis has been frequently exploited in applied research to provide evidence about the underlying factors in various measurement instruments. A close inspection of a large number of studies published in leading applied linguistic journals shows that there is a misconception among applied linguists as to the relative merits of exploratory…
Descriptors: Factor Analysis, Construct Validity, Applied Linguistics, Computer Software
Kourea, Lefki; Lo, Ya-yu – International Journal of Research & Method in Education, 2016
Improving academic, behavioural, and social outcomes of students through empirical research has been a firm commitment among researchers, policy-makers, and other professionals in education across Europe and the United States (U.S.). To assist in building scientific evidences, executive bodies such as the European Commission and the Institute for…
Descriptors: Evidence Based Practice, Validity, Randomized Controlled Trials, Research Methodology
Reilly, Erin Dawna; Stafford, Rose Eleanore; Williams, Kyle Marie; Corliss, Stephanie Brooks – International Review of Research in Open and Distance Learning, 2014
The use of massive open online courses (MOOCs) to expand students' access to higher education has raised questions regarding the extent to which this course model can provide and assess authentic, higher level student learning. In response to this need, MOOC platforms have begun utilizing automated essay scoring (AES) systems that allow…
Descriptors: Online Courses, Essays, Scoring, Automation