Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 41 |
Since 2006 (last 20 years) | 76 |
Descriptor
Statistical Analysis | 192 |
Test Validity | 192 |
Test Reliability | 95 |
Testing | 61 |
Test Construction | 60 |
Foreign Countries | 46 |
Correlation | 40 |
Factor Analysis | 37 |
Scores | 36 |
Testing Problems | 35 |
Computer Assisted Testing | 33 |
More ▼ |
Source
Author
Gleser, Leon Jay | 2 |
Hambleton, Ronald K. | 2 |
He, Lianzhen | 2 |
Hurley, Christine | 2 |
Liu, Ou Lydia | 2 |
Livingston, Samuel A. | 2 |
Rios, Joseph A. | 2 |
Spicuzza, Richard | 2 |
Swarthout, David | 2 |
Thurlow, Martha | 2 |
ANDRADE, MANUEL | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 4 |
Location
Australia | 4 |
California | 4 |
China | 3 |
Iran | 3 |
Japan | 3 |
Netherlands | 3 |
Turkey | 3 |
Germany | 2 |
Israel | 2 |
Malaysia | 2 |
Minnesota | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
Rios, Joseph A.; Liu, Ou Lydia – American Journal of Distance Education, 2017
Online higher education institutions are presented with the concern of how to obtain valid results when administering student learning outcomes (SLO) assessments remotely. Traditionally, there has been a great reliance on unproctored Internet test administration (UIT) due to increased flexibility and reduced costs; however, a number of validity…
Descriptors: Online Courses, Testing, Test Wiseness, Academic Achievement
Öz, Hüseyin; Özturan, Tuba – Journal of Language and Linguistic Studies, 2018
This article reports the findings of a study that sought to investigate whether computer-based vs. paper-based test-delivery mode has an impact on the reliability and validity of an achievement test for a pedagogical content knowledge course in an English teacher education program. A total of 97 university students enrolled in the English as a…
Descriptors: Computer Assisted Testing, Testing, Test Format, Teaching Methods
He, Lianzhen; Min, Shangchao – Language Assessment Quarterly, 2017
The first aim of this study was to develop a computer adaptive EFL test (CALT) that assesses test takers' listening and reading proficiency in English with dichotomous items and polytomous testlets. We reported in detail on the development of the CALT, including item banking, determination of suitable item response theory (IRT) models for item…
Descriptors: Computer Assisted Testing, Adaptive Testing, English (Second Language), Second Language Learning
Cohen, Yoav; Levi, Effi; Ben-Simon, Anat – Applied Measurement in Education, 2018
In the current study, two pools of 250 essays, all written as a response to the same prompt, were rated by two groups of raters (14 or 15 raters per group), thereby providing an approximation to the essay's true score. An automated essay scoring (AES) system was trained on the datasets and then scored the essays using a cross-validation scheme. By…
Descriptors: Test Validity, Automation, Scoring, Computer Assisted Testing
Pujayanto, Pujayanto; Budiharti, Rini; Adhitama, Egy; Nuraini, Niken Rizky Amalia; Putri, Hanung Vernanda – Physics Education, 2018
This research proposes the development of a web-based assessment system to identify students' misconception. The system, named WAS (web-based assessment system), can identify students' misconception profile on linear kinematics automatically after the student has finished the test. The test instrument was developed and validated. Items were…
Descriptors: Misconceptions, Physics, Science Instruction, Databases
Zimmerman, Whitney Alicia; Kang, Hyun Bin; Kim, Kyung; Gao, Mengzhao; Johnson, Glenn; Clariana, Roy; Zhang, Fan – Journal of Statistics Education, 2018
Over two semesters short essay prompts were developed for use with the Graphical Interface for Knowledge Structure (GIKS), an automated essay scoring system. Participants were students in an undergraduate-level online introductory statistics course. The GIKS compares students' writing samples with an expert's to produce keyword occurrence and…
Descriptors: Undergraduate Students, Introductory Courses, Statistics, Computer Assisted Testing
García-Santillán, Arturo; Martínez-Rodríguez, Valeria; Santana, Josefina C. – European Journal of Contemporary Education, 2018
The purpose of this study was to determine if there is a structure of variables that allows us to understand the level of Anxiety towards Mathematics in high school students from the municipalities of Zacatal and Jamapa, Veracruz, Mexico. This was based on the seminal works of Richardson and Suinn [1972], who developed the Mathematics Anxiety…
Descriptors: Foreign Countries, High School Students, Anxiety, Mathematics Anxiety
Yu, Guoxing; He, Lianzhen; Rea-Dickins, Pauline; Kiely, Richard; Lu, Yanbin; Zhang, Jing; Zhang, Yan; Xu, Shasha; Fang, Lin – ETS Research Report Series, 2017
Language test preparation has often been studied within the consequential validity framework in relation to ethics, equity, fairness, and washback of assessment. The use of independent and integrated speaking tasks in the "TOEFL iBT"® test represents a significant development and innovation in assessing speaking ability in academic…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language
Bergsmann, Evelyn; Klug, Julia; Burger, Christoph; Först, Nora; Spiel, Christiane – Assessment & Evaluation in Higher Education, 2018
There is a lively discussion on how to evaluate competence-based higher education in both evaluation and competence research. The instruments used are often limited to course evaluation or specific competences, taking a rather narrow perspective. Furthermore, the instruments often comprise predetermined competences that cannot be adapted to higher…
Descriptors: Questionnaires, Minimum Competency Testing, Screening Tests, Higher Education
Gross-Spector, Michal; Cinamon, Rachel Gali – Journal of Career Development, 2018
To promote our theoretical understanding regarding the exploration process during adulthood, the current study focusses on this process as it relates to work and family life roles and the relations between them, during the transition to motherhood. Two instruments assessing vocational and maternal exploration, relating to self and environment…
Descriptors: Adults, Career Exploration, Career Development, Family Work Relationship
Kato, Daiki; Suzuki, Mikie – Education, 2018
The main purpose of this study is to hypothesize the existence of the general sense of role satisfaction and device a scale to measure it. The participants in study 1 were 1029 Japanese high school students (484 men and 545 women). The result of exploratory factor analysis suggested that the two-factor structure is adequate. One is "social…
Descriptors: Rating Scales, Test Construction, Psychometrics, Hypothesis Testing
Bayazidi, Aso; Saeb, Fateme – Advances in Language and Literary Studies, 2017
This study examined the equivalence and reliability of the two versions of the Vocabulary Levels Test in an Iranian context. This study was motivated by the fact that the Vocabulary Levels test is increasingly being used in Iran for both research and pedagogical purposes without having been checked for validity and reliability in this context. The…
Descriptors: Foreign Countries, Vocabulary, English (Second Language), College Second Language Programs
Timpe-Laughlin, Veronika; Choi, Ikkyu – Language Assessment Quarterly, 2017
Pragmatics has been a key component of language competence frameworks. While the majority of second/foreign language (L2) pragmatics tests have targeted productive skills, the assessment of receptive pragmatic skills remains a developing field. This study explores validation evidence for a test of receptive L2 pragmatic ability called the American…
Descriptors: Pragmatics, Language Tests, Test Validity, Receptive Language
Gehsmann, Kristin; Spichtig, Alexandra; Tousley, Elias – Literacy Research: Theory, Method, and Practice, 2017
Assessments of developmental spelling, also called spelling inventories, are commonly used to understand students' orthographic knowledge (i.e., knowledge of how written words work) and to determine their stages of spelling and reading development. The information generated by these assessments is used to inform teachers' grouping practices and…
Descriptors: Spelling, Computer Assisted Testing, Grouping (Instructional Purposes), Teaching Methods