Buscar

Validação de método in vitro para irritação cutânea

Prévia do material em texto

Regulatory Toxicology and Pharmacology 117 (2020) 104725
Available online 5 August 2020
0273-2300/© 2020 Elsevier Inc. All rights reserved.
Me-too validation study for in vitro skin irritation test with a reconstructed 
human epidermis model, KeraSkin™ for OECD test guideline 439 
Juhee Han a, Seolyeong Kim b, Su-Hyun Lee b, Jin-Sik Kim c, Yu Jin Chang d, Tae-Cheon Jeong e, 
Mi-Jeong Kang e, Tae-Sung Kim f, Hae Seong Yoon f, Ga Young Lee f, SeungJin Bae a,**, 
Kyung-Min Lim a,* 
a College of Pharmacy, Ewha Womans University, Seoul, Republic of Korea 
b Biosolution Co, Seoul, Republic of Korea 
c COSMAX Korea, Seongnam-si, Republic of Korea 
d Korea Conformity Laboratories, Incheon, Republic of Korea 
e College of Pharmacy, Yeungnam University, Gyeongsan, Republic of Korea 
f National Institute of Food and Drug Safety Evaluation, Ministry of Food and Drug Safety, Osong, Republic of Korea 
A R T I C L E I N F O 
Keywords: 
KeraSkinTM 
Reconstructed human epidermis 
OECD TG 439 
Reproducibility 
Predictive capacity 
Skin irritation 
A B S T R A C T 
We conducted a me-too validation study to confirm the reproducibility, reliability, and predictive capacity of 
KeraSkin™ skin irritation test (SIT) as a me-too method of OECD TG 439. With 20 reference chemicals, within- 
laboratory reproducibility (WLR) of KeraSkin™ SIT in the decision of irritant or non-irritant was 100%, 100%, 
and 95% while between-laboratory reproducibility (BLR) was 100%, which met the criteria of performance 
standard (PS, WLR≥90%, BLR≥80%). WLR and BLR were further confirmed with intra-class correlation (ICC, 
coefficients >0.950). WLR and BLR in raw data (viability) were also shown with a scatter plot and Bland-Altman 
plot. Comparison with existing VRMs with Bland-Altman plot, ICC and kappa statistics confirmed the compat-
ibility of KeraSkin™ SIT with OECD TG 439. The predictive capacity of KeraSkin™ SIT was estimated with 20 
reference chemicals (the sensitivity of 98.9%, the specificity of 70%, and the accuracy of 84.4%) and additional 
46 chemicals (for 66 chemicals [20 + 46 chemicals, the sensitivity, specificity and accuracy: 95.2%, 82.2% and 
86.4%]). The receiver operating characteristic (ROC) analysis suggested a potential improvement of the pre-
dictive capacity, especially sensitivity, when changing cut-off (50% → 60–75%). Collectively, the me-too vali-
dation study demonstrated that KeraSkin™ SIT can be a new me-too method for OECD TG 439. 
1. Introduction 
The European Commission has prohibited animal experiments for 
cosmetics in 2013 and has been working on developing alternative test 
methods to replace animal testing based on the ‘3R’s Principle’ (Adler 
et al., 2011). Since the EU ban on animal testing for cosmetics, more 
than 37 countries worldwide have legally prohibited animal experi-
ments for the development of cosmetics (Akbarsha and Mascarenhas, 
2019). In addition, there is an increasing demand for new approach 
methodologies (NAMs) to evaluate the safety of numerous chemicals on 
human health in various sectors (Parish et al., 2020). In vitro skin irri-
tation test, OECD TG 439, has been developed to replace OECD TG 404 
(OECD, 2002) in which rabbits are used as a test species to evaluate the 
skin irritancy of chemicals and cosmetic products (Kose et al., 2018; 
Park et al., 2018). OECD TG 439 uses a reconstructed human epidermis 
(RhE) (OECD, 2015a) and can be used as a stand-alone to identify UN 
GHS No category chemical, i.e., non-irritant or in combination with 
other replacement methods, such as OECD TG 435 “In Vitro Membrane 
Barrier Test Method for Skin Corrosion” (OECD, 2015b) to further 
classify the hazard on skin in the framework of Integrated Approach on 
Abbreviations: OECD, The Organisation for Economic Co-operation and Development; TG, Test guideline; PS, Performance Standards; VRM, Validated Reference 
Method; SIT, skin irritation test; WLR, Within-laboratory reproducibility; BLR, Between-laboratory reproducibility; RhE, Reconstructed human Epidermis; CV, cell 
viability; I, irritant; NI, non-irritant; GHS, Globally Harmonized System of Classification and Labelling of Chemicals. 
* Corresponding author. 
** Corresponding author. 
E-mail addresses: sjbae@ewha.ac.kr (S. Bae), kmlim@ewha.ac.kr (K.-M. Lim). 
Contents lists available at ScienceDirect 
Regulatory Toxicology and Pharmacology 
journal homepage: www.elsevier.com/locate/yrtph 
https://doi.org/10.1016/j.yrtph.2020.104725 
Received 4 March 2020; Received in revised form 25 June 2020; Accepted 2 July 2020 
mailto:sjbae@ewha.ac.kr
mailto:kmlim@ewha.ac.kr
www.sciencedirect.com/science/journal/02732300
https://www.elsevier.com/locate/yrtph
https://doi.org/10.1016/j.yrtph.2020.104725
https://doi.org/10.1016/j.yrtph.2020.104725
https://doi.org/10.1016/j.yrtph.2020.104725
http://crossmark.crossref.org/dialog/?doi=10.1016/j.yrtph.2020.104725&domain=pdf
Regulatory Toxicology and Pharmacology 117 (2020) 104725
2
Testing and Assessment (IATA) for Skin Corrosion and Irritation (OECD, 
2017). 
Performance standards (PS) of OECD TG 439 for the assessment of 
similar and modified RhE-based test methods, OECD guidance docu-
ment 220 (OECD, 2015b) was developed to encourage the introduction 
of new ‘‘me-too’’ RhE models and methods. At the time of original 
adoption of OECD TG 439 (OECD, 2010), EpiDerm™ SIT and EpiSkin™ 
SIT were only two validated reference methods (VRMs). Since then, 
various me-too skin irritation tests like SkinEthic™ RhE SIT and Labcyte 
EPI-MODEL24 SIT were developed, validated and approved for OECD 
TG 439 in 2013. In 2019, OECD TG 439 was updated with two additional 
me-too test methods, epiCS® and Skin+® (OECD, 2019a; 2019b). 
KeraSkin™ is a newly developed RhE using human primary kerati-
nocytes to evaluate skin irritation of chemicals (Jung et al., 2014). 
Several studies have demonstrated that KeraSkin™ can be applied to the 
skin toxicity in various settings (Choi et al., 2014; Hwang et al., 2018; 
Jung et al., 2014; Kim et al., 2017; Lee et al., 2016, 2017). Here, we 
conducted a me-too validation test to confirm the reproducibility, and 
predictive capacity of KeraSkin™ SIT as a me-too method’ of OECD TG 
439 in accordance of OECD TG 439 PS with 20 reference chemicals. 
Additional 46 chemicals were tested to further evaluate the predictive 
capacity of KeraSkin™ SIT. 
In addition to the evaluation of WLR, BLR and predictive capacity 
based on proportion of concordance of decisions, various statistical 
methods have been used to assess the WLR and BLR, and compatibility 
to existing VRMs (Alépée et al., 2016; Lim et al., 2019a, 2019b). 
Intra-class Correlation Coefficient (ICC) is a commonly used indicator to 
assess reproducibility and is an estimate of the fraction of the total 
measurement variability caused by individual appreciator variation 
(Szklo and Nieto, 2014). It was also used in the analysis of WLR and BLR 
(Hirschmann et al., 2011). ICC values less than 0.5 are indicative of poor 
reliability, values between 0.5 and 0.75 indicate moderate reliability, 
values between 0.75 and 0.9 indicate good reliability, and values greater 
than 0.90 indicate excellent reliability (Koo and Li, 2016). 
The Bland-Altman plot is also recommended for comparative studies 
by showing the relationship between the mean and the difference in 
values obtained from each test result. It is very useful in examining 
patterns of inconsistencies between measurements (Atkinson, 1998). 
Bland-Altman plot shows three horizontal lines parallel to the x-axis, 
representing the mean differences between the measured values, and 
±95% limit of agreement. Bland-Altman plot was recommended that 
95% of the data points should lie within ±95% limit of agreement 
(Giavarina, 2015). In addition, Cohen’s kappa statistics is often used to 
measure the concordance between two observers for normal scales 
(Berry et al., 1988; Blackman, 2000). Kappa statistics can vary from − 1 
to 1 (Blackman, 2000; Landisand Koch, 1977). The kappa value less 
than 0.00 indicates poor, 0.00–0.20, slight, 0.21–0.40, fair and 
0.41–0.60, moderate, 0.61–0.80, substantial and 0.81–1.00, almost 
perfect concordance (Landis and Koch, 1977). 
For the assessment of optimal cutoffs, receiver operating character-
istic (ROC) curve analysis is an effective method to assess quality or 
performance (Park et al., 2004; Yang et al., 2017). In addition, to pro-
vide 95% confidence interval for predictive capacity with a small sample 
number, Wilson’s score confidence interval was used (Alépée et al., 
2016). With the various statistical analysis, we could demonstrated that 
KeraSkin™ SIT exhibits good reproducibility and predictive capacity 
that can meet the criteria of OECD TG 439 PS, which could contribute to 
reducing animal tests by providing with another new method to replace 
the rabbit skin irritation test. 
2. Material and methods 
2.1. Chemicals and reagents 
The 20 reference chemicals as suggested in the OECD TG 439 per-
formance standards were used as described in Table 1. 20 chemicals 
consisted of 10 non-irritants (top 10 in Table 1) and 10 irritants (bottom 
10 in Table 1) according to the UN GHS category, of which 4 were solids 
and 16, liquids. Also, 46 chemicals (19 solids and 27 liquids) were 
selected from previous validation study reports and papers to further 
Table 1 
20 reference chemicals recommended from GD 220 published in 2015. 
No. Chemicals CASN UN GHS Cat. Physical 
Status 
Storage 
condition 
Manufacturer Cat No. Purity 
(%) 
Non-classified 
chemicals 
1 1-Bromo-4-chlorobutane 6940-78- 
9 
No Cat. liquid RT Sigma B60800 99.0 
2 Diethyl phthalate 84-66-2 No Cat. liquid RT Sigma 524972 99.9 
3 Naphthalene acetic acid 86-87-3 No Cat. solid RT Sigma N0640 97.0 
4 Allyl phenoxy-acetate 7493-74- 
5 
No Cat. liquid RT Sigma W203807 99.8 
5 Isopropanol 67-63-0 No Cat. liquid RT Sigma W292907 100.0 
6 4-Methyl-thio-benzaldehyde 3446-89- 
7 
No Cat. liquid RT Sigma 222771 98.9 
7 Methyl stearate 112-61-8 No Cat. solid RT Sigma 85769 99.2 
8 Heptyl butyrate 5870-93- 
9 
No Cat. (Optional 
Cat. 3) 
liquid RT Sigma W254908 99.4 
9 Hexyl salicylate 6259-76- 
3 
No Cat. (Optional 
Cat. 3) 
liquid RT Sigma W520306 99.5 
10 Cinnamaldehyde 104-55-2 No Cat. (Optional 
Cat. 3) 
liquid RT Sigma W228613 99.1 
Classified 
chemicals 
11 1-Decanol 112-30-1 Cat. 2 liquid RT Sigma 239763 99.1 
12 Cyclamen aldehyde 103-95-7 Cat. 2 liquid RT TCI I0377 97.8 
13 1-bromohexane 111-25-1 Cat. 2 liquid RT Sigma B68240 99.3 
14 2-Chloromethyl-3,5-dimethyl-4- 
methoxypyridine HCL 
86604- 
75-3 
Cat. 2 solid RT Sigma 535508 ≥99.9 
15 Di-n-propyl disulphide 629-19-6 Cat. 2 liquid RT Sigma 149225 97.8 
16 Potassium hydroxide(5% aq.) 1310-58- 
3 
Cat. 2 liquid RT Sigma 417661 – 
17 Benzenethiol, 5-(1,1-dimethylethyl)- 
2-methyl 
7340-90- 
1 
Cat. 2 liquid RT ACROS 279960500 93.9 
18 1-Methyl-3-phenyl-1-piperazine 5271-27- 
2 
Cat. 2 solid RT AK scientific A083 99.0 
19 Heptanal 111-71-7 Cat. 2 liquid RT Sigma W254002 97.7 
20 Tetrachloroethylene 127-18-4 Cat. 2 liquid RT Sigma 371696 100.0 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
3
assess predictive capacity. In total, 66 chemicals composed of 23 solids 
and 43 liquids (21 irritants and 45 non-irritants) were tested in further 
evaluating the predictive capacity of KeraSkin™ SIT. Participating lab-
oratories (Biosolution Co, Seoul, Korea, COSMAX Korea, Seongnam-si, 
Korea and Korea Conformity Laboratories, Incheon, Korea) tested all 
the substances under the blind conditions. 
2.2. KeraSkin™ SIT protocol 1.5 
Overall procedure of KeraSkin™ SIT was described in Fig. 1. Original 
protocol of KeraSkin™ SIT was developed in 2012, which was estab-
lished through adopting the existing VRMs of OECD TG 439. Five times 
of revisions were made to optimize KeraSkin™ SIT protocol and to 
correct mis-predictions for some PS chemicals with respect to washing 
procedure (employment of squeeze bottle and cotton-swab), removal of 
mesh application, treatment volume (from 30 to 40 μL) and treatment 
time (from 40 min to 30 min). 
Viscous substance sometimes made the mesh stick to the tissue even 
after the washing process, so the mesh was removed. Instead the treat-
ment volume was increased (30 μL→40 μL) to evenly distribute the 
substance on the tissue and the treating time (45 min→30 min) was 
shortened to compensate the increased treatment volume. Post- 
incubation time of 42 h is the same as the existing VRMs. 
Briefly, the KeraSkin™ model, which passed the quality control 
[optical density values of negative control (NC) by MTT (3-(4,5-dime-
thylthiazol-2-yl)-2,5-diphenyltetrazolium bromide) assay: 0.5–1.2, 
ET50: 6.9.0–14.0 h, IC50 for sodium dodecyl sulfate (SDS) 1.5–4.7 mg/ 
mL and histological examination] of the manufacturer (Biosolution Co. 
Seoul, Korea) was delivered to the testing labs in a 24-well format on an 
agarose gel at about 25 ◦C. Actual QC data for the 13 batches used for the 
main study was NC OD: 0.9 ± 0.1, ET50: 10.7 ± 1.5, IC50: 3.3 ± 0.4 
(Mean ± SD). It was confirmed through an oversea shipment that Ker-
aSkin™ was stable up to 56 h of delivery as determined by quality 
control criteria of NC OD and IC50 for SDS (data not shown). Upon 
receipt of the shipment, culture medium provided by the manufacturer 
was pre-warmed in a 37 ◦C thermostat for 30 min. As for the pre- 
incubation step, 900 μL of the pre-warmed medium was added to each 
well of a 6-well plate using micropipette and the RhE model insert was 
carefully transferred to a well using forceps. Then the well-plate was pre- 
incubated at 37 ◦C, 5% CO2 for 22 ± 2 h. To conduct the experiment, 40 
Fig. 1. KeraSkinTM SIT protocol 1.5. Overall KeraSkin™ skin irritation test (SIT) procedure. CV, cell viability, NI, non-irritant. 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
4
μL of liquid substance or solution, or 40 mg (with 40 μL DPBS) of solid 
substance was topically applied on the upper epithelial surface of the 
model insert (0.6 cm2) after the pre-incubation. The tissue was incu-
bated again (37 ◦C, 5% CO2 condition) for 30 ± 1 min (Fig. 2). Then the 
tissue was washed to remove the test substances and further incubated 
(37 ◦C, 5% CO2 condition) for 42 ± 2 h. The resulting tissue viability was 
determined by MTT assay. 
2.2.1. Washing 
Washing procedure for the 6-wells should be done within 20 min. 
DPBS for washing should be kept warm in a 37 ◦C thermostat before use. 
The treated tissue was removed from a 37 ◦C, 5% CO2 incubator and 
DPBS was flowed on the side wall of the insert using a poly wash bottle, 
and when DPBS was fully filled in the insert, DBPS in the insert was 
poured to a beaker at once. Washing was repeated 10 times in total. 
However, if residual material (i.e. oils or highly viscous liquid materials, 
and solid materials with gelation) was still observed after 5 times of 
wash, a cotton swab was used to wipe off the residual materials. After 
washing, DPBS outside of the insert were removed using a sterile gauze. 
2.2.2. MTT assay 
After post-incubation, the media inside and outside of inserts were 
removed with a pipette. Inserts were transferred into 24-well plates, 
prefilled with 200 μL of 0.4 mg/mL MTT per a well. Then, 100 μL of MTT 
solution was further added to the inside of insert. Then the plate was 
covered with aluminum foil to protect from the light and placed in an 
incubator (37 ◦C, 5% CO2) for 3 h ± 5 min. Afterincubation, MTT so-
lution remaining was removed from the inside/outside of the insert 
using 200 μL pipette and the tissues were transferred to a new 6-well 
plate containing 1.9 mL/well of isopropanol. Then additional 100 μL 
of isopropanol was added to the inside of the insert. Plate was covered 
from the light using aluminum foil and shaken on a plate shaker for 3 h 
± 5 min to extract formazan. After formazan extraction, the extracts 
inside and outside of the insert were collected and mixed well by 
pipetting such that formazan crystals were dissolved clearly. 250 μL of 
extract per a well was transferred into a 96 well plate, for the optical 
density (OD) measurement at 570 nm. 
2.2.3. Detecting and correcting interference from colored chemicals and 
MTT reducers 
In order to prevent the possible interference with the MTT reading by 
colored chemicals, i.e., ‘direct staining’ or ‘MTT reacting’ chemicals, 
were pre-checked. Though, when a test substance is determined to be an 
irritant, the correction shall be not necessary since correction always 
over-predict the classification of skin irritancy by subtracting the cell 
viability. Accordingly, the color interference corrections may be 
reserved as optional for the test chemicals with significant concerns. 
Overall scheme for correcting color interference could be advised as 
described in Fig. 2. 
Colored test substance may interfere with OD measurement of MTT 
formazan by directly staining the tissue. 0.5 mL of deionized water and 
40 μL (liquid) or 40 mg (solid) of the test chemical were added to each 
Eppendorf tube and the mixture was incubated in a cell incubator (37 ±
1 ◦C, 5 ± 1% CO2, 95% RH) for 30 min. The color changes were 
observed, and if there was a significant color change, correction for color 
interference with the viability value calculated from OD of the staining 
control [viability value of a tissue incubated with phenol red free-DMEM 
instead of MTT solution (%NSCliving)] was necessary. 
Fig. 2. Color interference post-checking scheme for KeraSkin™ SIT. CV, cell viability, %TT, Uncorrected viability value, %NWliving, viability value of a tissue 
incubated with phenol red free-DMEM instead of MTT solution, %NSMTT, viability value of reactivity control obtained with OD from the test done with freeze-killed 
tissues, %NSCKilled, viability value of freeze-killed tissue incubated with phenol red free-DMEM instead of MTT solution. 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
5
Table 2 
Within- and between-laboratory reproducibility of the Keraskin™ SIT test results. 
Fig. 3. Scatter plot of 3 runs for 20 PS reference chemicals by 3 laboratories. Cell viability data from the KeraSkin™ SIT. Scatter plot with the mean± ​ SD (error bar) 
of 3 runs for each lab. Biosolution (BS), KCL and COSMAX. 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
6
A test substance with MTT reacting activity may reduce MTT directly 
and form formazan alone. 0.5 mL of MTT solution (0.4 mg/mL) and 40 
μL (liquid) or 40 mg (solid) of the test chemical were added to an 
Eppendorf tube and the mixture was incubated in a cell incubator (37 ◦C, 
5% CO2, 95% RH) for 1 h. The color change was observed and if the 
color of MTT solution changes to blue/purple, correction for color 
interference with the viability value of reactivity control obtained with 
OD from the test done with freeze-killed tissues (%NSMTT) was 
necessary. 
If a test substance was both direct-staining and MTT-reacting, then 
the correction needs to be done for both direct-staining and MTT reac-
tivity, but the color interference would be subtracted twice. To resolve 
this, another staining control with freeze-killed tissues [viability value of 
freeze-killed tissue incubated with phenol red free-DMEM instead of 
MTT solution (%NSCKilled)] is necessary. 
2.2.4. Prediction model for the KeraSkin™ SIT 
According to the EU and the GHS classification, an irritant is pre-
dicted if the mean relative tissue viability of minimum three replicate 
tissues exposed to the test chemical is reduced to 50% or below of the 
mean viability of the negative controls. The test chemical was defined as 
a non-irritant if the tissue viability was higher or equal to 50%. Other-
wise, it was determined as an irritant. Since 50% cut-off value is adopted 
by all existing VRMs of OECD TG 439, KeraSkin™ SIT protocol has been 
optimized to 50% cut-off. 
2.2.5. Acceptance criteria 
A test is regarded as acceptable unless the results do not fall into the 
criteria for a re-test, which is adopted for KeraSkin™ SIT from OECD TG 
439, 
1) If absorbance value of the negative control chemical is less than 0.7 
or exceeds 1.6. 
2) If a cell viability value of the positive control chemical is higher than 
40%. 
3) In negative control chemical, positive control chemical, and test 
chemical: When the standard deviation of the cell viability value of 
treated tissue in replications exceeds 18%. 
4) If the average cell survival rate of the chemical-treated-well is be-
tween 45% and 55% (≥45% and ≤55%, borderline chemicals). 
If the re-test gives borderline value again, the concordance of the 
decision should be considered. If the original and re-test are concordant 
in decision, additional re-test is not necessary. If the decision is different, 
then the third test should be conducted and majority vote approach 
should be taken for final decision. Acceptance criteria for KeraSkin™ SIT 
were established referring to existing VRMs of OECD TG 439. 
2.3. Statistical analysis for the reliability of the KeraSkin™ SIT 
2.3.1. Data collection and chart 
Cell viability data were obtained for 20 reference chemicals from a 
total of three repetitions of KeraSkin™ SIT by three laboratories (total 9 
runs of data for a chemical were obtained). 46 additional chemicals were 
tested by two laboratories, Biosolution and KCL (One or two runs of data 
for a chemical was obtained). All participating laboratories performed 
the tests and quality assurance in the spirit of Good Laboratory Practice 
(GLP). The cell viability was shown in the table as mean± ​ SD. Also, the 
results were plotted on a scatter plot to show the data distribution at a 
glance, with the cut-off line to identify incorrect decisions. 
2.3.2. Within-laboratory and between-laboratory reproducibility 
Within-laboratory reproducibility (WLR) is an evaluation of the 
consistency of the results of three replicate tests conducted by a labo-
ratory and between-laboratory reproducibility (BLR) is an evaluation of 
the consistency between the three laboratories using the mean value of 
results from different independent laboratory tests of the reference 
chemical. WLR and BLR for 20 chemicals were presented as a 
percentage. 
WLR and BLR were also expressed in confidence intervals using 
Wilson’s score confidence interval method to consider the uncertainty in 
the value of the point estimate presented. ICC and Bland-Altman 
methods were also used for the reproducibility analysis. The formula 
for the ICC is, 
ICC=
Vb
VT
=
Vb
Vb + Ve 
in which Vb = variance between individuals, VT = total variance, which 
includes both Vb and Ve [Ve = unwanted variance (“error”)] (Szklo and 
Nieto, 2014). There are two types of ICC: Single measures ICC is an index 
for the reliability of the ratings for one, typical, single rater while 
average measures ICC is of different raters averaged together. (Shrout 
and Fleiss, 1979) The average measures ICC is always higherthan the 
single measures ICC. (Bartko, 1976) 
2.3.3. Correlation with validated reference methods 
In order to determine the compatibility of KeraSkin™ SIT with 
validated reference methods (VRMs) of OECD TG 439; EpiDerm™ SIT, 
EpiSkin™, and LabCyte EPI-MODEL24 SIT, ICC analysis, Bland-Altman 
plot and Cohen’s kappa analysis were used to evaluate the agreement. 
2.4. Predictive capacity of the KeraSkin™ SIT 
2.4.1. 20 chemicals and 46 additional chemicals for the evaluation of 
predictive capacity 
The mis-predicted test/total was used to determine the predictive 
capacity (Alépée et al., 2016) where in all test results of each laboratory 
were used. The judgment are classified as irritant (I) and non-irritant 
(NI) and concordance of the reference values and test results was 
shown in 2 × 2 contingency table. The result of 2 × 2 contingency table 
judgment was calculated with various predictive power indicators 
(Cooper and Saracci, 1979). Predictive capacity was calculated as an 
index of sensitivity, specificity, and accuracy. The results were presented 
as point estimates and 95% confidence intervals were calculated using 
Wilson’s score confidence interval. According to the OECD TG 439 
Performance Standards, the sensitivity, specificity, and accuracy shall be 
80%, 70%, and 75%, respectively or higher. 
The ROC curve was used to evaluate the predictive capacity of Ker-
aSkin™ SIT. The additional predictive capacity assessment consisted of 
Table 3 
Intra-class correlation coefficient of the test results of 3 repetitions in 3 labs. 
Intra-class 
correlation 
coefficient (ICC) 
95% confidence 
interval 
Lower 
bound 
Upper 
bound 
WLR BS Single 
measures 
.971 .940 .987 
Average 
measures 
.990 .979 .996 
KCL Single 
measures 
.976 .949 .989 
Average 
measures 
.992 .983 .996 
COSMAX Single 
measures 
.957 .912 .981 
Average 
measures 
.985 .969 .994 
BLR Single 
measures 
.961 .921 .983 
Average 
measures 
.987 .972 .994 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
7
Fig. 4. Bland-Altman plots for evaluating the BLR between two laboratories. Bland-Altman plots for a) BS and KCL, b) BS and COSMAX, c) KCL-COSMAX. Dotted 
lines indicate 95% confidence limits. Blue circle represents liquid chemicals and red circle, solids. (For interpretation of the references to color in this figure legend, 
the reader is referred to the Web version of this article.) 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
8
66 chemicals with 46 chemicals added to the existing 20 chemicals. 66 
chemicals have unequal numbers of test results. To correct this, a 
weighted approach analysis was also performed for the analysis of 
predictive capacity. 
3. Results 
3.1. Within-and between-laboratory reproducibility 
To investigate whether KeraSkin™ SIT met the criteria of “me-too” 
test method evaluation stated in the performance standards (PS) for 
OECD TG 439, 20 reference chemicals were subject to three repeated 
tests by three laboratories (Table 2 and Fig. 3). Overall decisions were 
mostly consistent within and between laboratories except for Di-n- 
propyl disulphide for which Lab 3 incorrectly predicted once. Within- 
laboratory reproducibility (WLR) and between-laboratory reproduc-
ibility (BLR) in cell viability and decisions were also presented with a 
scatter plot which showed small scatters of viability data for each 
chemical except for Di-n-propyl disulphide (No. 5). In terms of concor-
dance in decision of “non-irritant” or “irritant”, Lab 1 and Lab 2 showed 
100% (20/20) and Lab 3, 95% (19/20) WLRs while BLR of three labo-
ratories was 100% (20/20). These results met the criteria of WLR ≥ 90% 
and BLR ≥ 80% of OECD TG 439 PS. 
ICC was also calculated to evaluate WLR and BLR of cell viability 
results of KeraSkin™ SIT. Both the single measures and average mea-
sures ICC of WLR for BS, KCL and COSMAX, and BLR were all above 0.9 
(Table 3), suggesting an excellent level of WLR and BLR of KeraSkin™ 
SIT. Also, Bland-Altman plot were drawn to evaluate BLR as shown in 
Fig. 4, which suggests that most of values fell within 95% agreement 
limit, and the difference between the measured values was close to zero, 
regardless of the physical states. Interestingly it could be seen that mean 
CV values aggregated around 0 or 100%, reflecting an excellent iden-
tification of irritants from non-irritants. 
3.2. Correlation of KeraSkin™ SIT with validated reference methods of 
OECD TG 439 
KeraSkin™ SIT was compared with other existing validation refer-
ence methods (VRMs) of OECD TG 439 using ICC, Bland-Altman plot 
and Kappa statistics. As shown in Table 4, the average ICC between 
KeraSkin™ SIT and Episkin™ SIT was 0.972 (95% CI: 0.929–0.989), 
EpiDerm™ SIT, 0.966 (95% CI: 0.914–0.986), and LabCyte EPI- 
MODEL24 SIT, 0.993 (95% CI: 0.983–0.997), suggesting an excellent 
correlation. Also, the Bland-Altman plot was distributed close to zero, as 
shown in Fig. 5. There was no difference between the liquids and the 
solids but viability values of KeraSkin™ SIT tended to be lower than 
VRMs. Especially, 4-methyl-thio-benzaldehyde and Di-n-propyl disul-
phide were outliers for Episkin™-KeraSkin™, and EpiDerm™- and 
LabCyte EPI-MODEL24-KeraSkin™ pairs, reflecting that KeraSkin™ SIT 
may be more sensitive. In the Kappa statistics (Table 5), Episkin™- 
KeraSkin™ showed 0.608, i.e., substantial concordance while Epi-
Derm™- and LabCyte EPI-MODEL24-KeraSkin™ were 0.828, almost 
perfect concordances. 
3.3. Predictive capacity of KeraSkin™ SIT for 20 reference chemicals 
The predictive capacity of KeraSkin™ SIT, i.e., sensitivity, specificity 
and accuracy was evaluated with three repeated tests for 20 PS chem-
icals by three laboratories (Table 6). With all 180 individual runs, mis- 
predicted test/total was calculated. The result showed the sensitivity 
of 98.9%, specificity of 70%, and accuracy of 84.4%. Wilson’s 95% 
confidence interval of sensitivity was 0.940–0.998, specificity was 
0.599–0.785 and accuracy was 0.784–0.890. 
3.4. Color interference 
All three participating laboratories performed the correction pro-
cedure for color interference (Fig. 2) for all 20 reference chemicals to 
pre-check the possibility of direct staining and MTT reactivity. No 
correction was necessary for the 20 PS chemicals in Lab 1 and Lab 2, but 
in Lab 3, decision of a chemical, Benzenethiol, 5-(1,1-dimethylethyl)-2- 
methyl was changed from non-irritant to irritant due to a substantial 
MTT reactivity as shown in Table 7. 
3.5. Predictive capacity of KeraSkin™ SIT for 66 chemicals 
Additional 46 reference chemicals (Table 8) with in vivo Draize score 
available were selected and tested to further evaluate the predictive 
capacity of KeraSkin™ SIT. Overall, data for 66 chemicals (46 + 20 PS 
chemicals) was obtained (Fig. 6). KeraSkin™ SIT showed a sensitivity of 
95.2%, specificity of 82.2% and accuracy of 86.4% for those 66 chem-
icals, which met OECD PS criteria. The predictive capacity was further 
calculated for liquids (43) and solids (23) separately, which also satis-
fied the criteria (liquids; sensitivity of 100%, specificity of 73.1% and 
accuracy of 83.7%, and solids; 75%, 94.7%, 91.3%) (see Table 9). 
The ROC curve analysis was conducted with viability data for 66 
chemicals. As shown in Fig. 7, cut-off value of 60–80% was optimum for 
solids while 45–75% was for liquids. Overall, optimal cut-off values 
were determined to be 60–75%. The AUC was estimated to be 0.7235, 
whichrepresents a reasonable level of predictive capacity (Park et al., 
2004; Youngstrom, 2014). Also, the weighted approach analysis was 
conducted to correct unequal number of test results across chemicals, 
which showed the sensitivity 94.7%, specificity 82.2%, and accuracy 
86.2% (Table 10). 
4. Discussion 
This study was conducted to determine if the KeraSkin™ SIT satisfies 
the performance standards (PS) of OECD TG 439. The assessment of 
reproducibility and predictive capacity of the KeraSkin™ SIT test 
method was performed using 20 PS chemicals and additional 46 
chemicals were tested to further assess the predictive capacity. Ker-
aSkin™ SIT has achieved a WLR of 95%–100% in all three participating 
Table 4 
Intra-class correlation coefficient of the test results between the KeraSkin™ SIT 
and validated reference methods. 
KeraSkin™ SIT Intra-class correlation 
coefficient (ICC) 
95% confidence 
interval 
Lower 
bound 
Upper 
bound 
Episkin™ Single 
measures 
.945 .867 .978 
Average 
measures 
.972 .929 .989 
EpiDerm™ SIT Single 
measures 
.934 .841 .973 
Average 
measures 
.966 .914 .986 
LabCyte 
EPI- 
MODEL24 
SIT 
Single 
measures 
.986 .966 .995 
Average 
measures 
.993 .983 .997 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
9
Fig. 5. Bland-Altman plots for evaluating the agreement between KeraSkin™ SIT and existing VRMs of OECD TG 439. Bland-Altman plots for a) EpiSkin™-Ker-
aSkin™ SIT, b) EpiDerm™ SIT-KeraSkin™ SIT, c) LabCyte EPI-MODEL24 SIT- KeraSkin™ SIT. Dotted lines indicate 95% confidence limits. Blue circle represents 
liquid chemicals and red circle, solids. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.) 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
10
laboratories and a BLR of 100% in the three participating laboratories. 
Predictive capacity of the KeraSkin™ SIT (sensitivity, specificity and 
accuracy: 98.9%, 70%, 84.4%) for 20 reference chemicals was also 
better or equal to the PS criteria (sensitivity, specificity and accuracy: 
≥80%, ≥70% and ≥75%). With additional chemicals, predictive ca-
pacity was a sensitivity of 95.2%, a specificity of 82.2% and an accuracy 
of 86.4% for 66 chemicals (20 PS chemicals + 46 additional chemicals). 
There were 4 false predictions for the non-irritant chemicals. 1- 
bromo-4-chloroethylene (no. 11), 4-methyl-thio-benzaldehyde (no. 
16), cinnamaldehyde (no. 20) were incorrectly predicted in all three 
laboratory. This is considered to be due to the characteristic of the 
materials or limitation of RhE SITs since other VRMs like EpiDerm™ SIT 
and LabCyte EPI-MODEL24 SIT methods also predicted them incorrectly 
(Kandárová et al., 2009; Katoh and Hata, 2011; Spielmann et al., 2007). 
Interestingly, KeraSkin™ SIT can predict the irritant, Di-n-propyl 
disulphide, correctly except for one run by Lab 3. Di-n-propyl disul-
phide, along with 1-decanol, is Category 2 chemical and it was 
frequently predicted as a false negative in VRMs, suggesting that Ker-
aSkin™ SIT may be more sensitive than existing VRMs. By correctly 
identifying Di-n-propyl disulphide, the sensitivity of KeraSkin™ SIT was 
higher than VRMs for 20 reference chemicals, exceeding the sensitivity 
Table 5 
Cohen’s Kappa statistics of the results between the KeraSkin™ SIT and the 
validated reference methods in OECD TG 439. 
comparison Kappa coefficient P- value 
EpiSkin™ SIT .608 .007 
EpiDerm™ SIT .828 .000 
LabCyte EPIMODEL24 SIT .828 .000 
Table 6 
Predictive capacity of KeraSkin™ SIT for 20 reference chemicals in OECD TG 
439 PS. 
PS criteria Predictive capacity 
BS (60) KCL (60) COSMAX (60) Total(180) 
Sensitivity ≥80% 100% 100% 96.7% 98.9% 
Specificity ≥70% 70% 70% 70% 70% 
Accuracy ≥75% 85% 85% 83.3% 84.4% 
Lab1; Biosolution Co., Lab2; Korea Conformity Laboratories, Lab3; COSMAX. 
Table 7 
Color Interference check for 20 PS reference chemicals in COSMAX. 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
11
criteria of PS, which was confirmed again with 66 chemicals (sensitivity 
of 95.2%). 
Comparison of KeraSkin™ SIT with VRMs of OECD TG 439 revealed 
that KeraSkin™ SIT agrees well with VRMs with respect to irritant de-
cisions and raw data (viability). Various statistical analyses were 
conducted, such as ICC, Bland-Altman plot and Kappa statistics. Pair- 
wise comparison with all VRMs using these statistical methods all sug-
gests a moderate or high correlation between KeraSkin™ SIT and 
existing VRMs. Interestingly, KeraSkin™ SIT was more similar to Epi-
Derm™ and LabCyte EPI-MODEL24 than Episkin™ as determined by 
Table 8 
List of 46 substances for further evaluation. 
No Chemicals CASN In vivo Class (In 
vivo Cat.) 
Physical 
Status 
Storage 
condition 
Manufacturer Cat No. Purity 
(%) 
1 Phenylethyl alcohol (2-Phenylethanol) 60-12-8 NI (NC) liquid RT Sigma 77861 99.7 
2 Isopropyl myristate 110-27-0 NI (NC) liquid RT Sigma 172472 98.0 
3 Triethylene glycol 112-27-6 NI (NC) liquid RT Sigma T59455 99.3 
4 4,4′-Methylenebis (2,6-di-tert-butylphenol) 118-82-1 NI (NC) solid RT Sigma 277924 99.6 
5 Benzyl benzoate 120-51-4 NI (NC) liquid RT Sigma B6630 99.8 
6 3-Chloronitrobenzene (1-Chloro-3-nitrobenzene) 121-73-3 NI (NC) solid RT Sigma 218758 99.8 
7 Isopropyl palmitate 142-91-6 NI (NC) liquid RT Sigma W515604 99.9 
8 Dodecanoic acid (Lauric acid) 143-07-7 NI (NC) solid RT Sigma W261408 98.0 
9 Sodium bicarbonate 144-55-8 NI (NC) solid RT Sigma S5761 100.0 
10 4-Amino-1,2,4-triazole (4-Amino-4H-1,2,4-triazole) 584-13-4 NI (NC) solid RT Sigma A81803 99.8 
11 3,3′-Dithiodipropionic acid 1119-62-6 NI (NC) solid RT Sigma 109010 99.8 
12 Silane A-1430 ((3-Chloro propyl) trimethoxy silane) 2530-87-2 NI (NC) liquid RT Sigma 440183 97.5 
13 2,5-Dimethyl-4-oxo-4,5-dihydrofuran-3-yl acetate (4- 
Acetoxy-2,5-dimethyl-3(2H)-furanone) 
4166-20-5 NI (NC) liquid RT TCI A2192 98.1 
14 Di-propylene glycol 25265-71- 
8 
NI (NC) liquid RT Sigma D215554 99.0 
15 Polyethylene glycol 400 25322-68- 
3 
NI (NC) liquid RT TCI N0443 – 
16 Dipropylene glycol monobutyl ether (DPnB) 
(Di(propylene glycol) butyl ether, mixture of isomers) 
29911-28- 
2 
NI (NC) liquid RT Sigma 484237 99.1 
17 Propylene glycol 57-55-6 NI (NC) liquid RT Sigma P4347 100.0 
18 1-(4-Chlorophenyl)-3-(3,4-dichlorophenyl)urea 101-20-2 NI (NC) solid RT Sigma 105937 98.9 
19 Erucamide 112-84-5 NI (NC) solid − 18 ◦C Sigma 280577 88.4 
20 Benzyl salicylate 118-58-1 NI (NC) liquid RT Sigma 84260 99.9 
21 3.3-Dimethylpentane 562-49-2 NI (NC) liquid RT ACROS 152660050 99.6 
22 Tetrabromophenol blue 4430-25-5 NI (NC) solid RT Sigma 199311 80.0 
23 2-Ethylhexyl 4-methoxycinnamate 5466-77-3 NI (NC) liquid RT Sigma 78848 98.9 
24 Sodium bisulphite 7631-90-5 NI (NC) solid RT Sigma 243973 – 
25 2-(Formylamino)-3-thiophenecarboxylic acid 43028-69- 
9 
NI (NC) solid RT Sigma S935344 100.0 
26 Methyl laurate (at 37 ◦C, liquid) 111-82-0 NI (Cat 3) liquid RT Sigma 234591 99.9 
27 Benzophenone-3 131-57-7 NI (NC) solid RT Sigma H36206 99.5 
28 3-Chloro-4-fluoronitrobenzene (below 25 ◦C, solid) 350-30-1 NI (NC) solid RT Sigma 233234 99.2 
29 Cyclohexadecanone (below 37 ◦C, solid) 2550-52-9 NI (NC) solid RT Symrise 600359 – 
30 2-Phenylhexanenitrile3508-98-3 NI (Cat 3) liquid RT IFF 190979 – 
31 2-Ethylhexyl-4-aminobenzoate 26218-04- 
2 
NI (NC) solid 4 ◦C Santacruz SC-496103 96.0 
32 Capryl-isostearate 209802- 
43-7 
NI (NC) liquid RT Nikko chemical S028795 100.0 
33 Diisopropyl sebacate 7491-02- 
03 
NI (NC) liquid RT Nippon Fine 
Chemicals 
– 100.0 
34 Barium sulfate 7727-43-7 NI (NC) solid RT/ 
hygroscopic 
ACROS 222515000 99.2 
35 10% Xanthan gum (in D.W.) 11138-66- 
2 
NI (NC) gel RT/ 
hygroscopic 
Sigma 43708 – 
36 1-Bromopentane 110-53-2 I (Cat 2) liquid RT Sigma 117811 99.5 
37 Nonanoic acid (at 37 ◦C, liquid) 112-05-0 I (Cat 2) liquid 2–8 ◦C Sigma N5502 97.0 
38 Decanoic acid (at 37 ◦C, liquid) 334-48-5 I (Cat 2) liquid RT Sigma W236403 99.3 
39 Butyl methacrylate 97-88-1 I (Cat 2) liquid RT Sigma 235865 99.4 
40 alpha-Terpineol 98-55-5 I (Cat 2) liquid RT Sigma W304506 97.9 
41 Butyric acid 107-92-6 I (Cat 1B) liquid RT Sigma B103500 99.8 
42 Heptanoic acid 111-14-8 I (R34) liquid RT Sigma 75190 99.9 
43 Octanoic acid (Caprylic acid) 124-07-2 I (Cat 1B/1C) liquid RT Sigma C2875 99.9 
44 25% Cetrimonium chloride (in D.W.) 112-02-7 I (Cat 2) liquid RT Sigma 292737 – 
45 Ferric chloride 7705-08-0 I (R34) solid RT Sigma 157740 97.0 
46 Piroctone olamine 68890-66- 
4 
I (Cat 2) solid 4 ◦C TCI P2178 99.8 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
12
ICC and Kappa statistics. More importantly, Bland-Altman plot revealed 
that KeraSkin™ SIT appears to be more sensitive overall than existing 
VRMs, which can explain the higher sensitivity than VRMs. These points 
could not be seen with a simple point estimation of predictive capacity, 
reflecting the utility of statistical methods for evaluating and comparing 
me-too tests with existing gold standards as also exemplified by the 
recent studies (Alépée et al., 2016; Lim et al., 2019a). 
Color interference is a potential issue for all colorimetric assay-based 
in vitro tests (Joo et al., 2019). We also confirmed that decision can be 
changed for a non-irritant with color interference after correction. 
However, the correction for color interference was seldom necessary 
since color-interfering chemicals persistent through vigorous washing 
steps are not common and irritant decisions are not affected by 
correction. One run with one chemical needed correction among 180 
runs in the main study. And among 46 chemicals tested for additional 
predictive capacity assessment, 10 chemicals showed concerns of color 
interference but correction did not change the decision (data not 
shown). Against this background, we suggest that correction for color 
interference may be reserved only for non-irritants as a post-check 
rather than doing pre-check for all chemicals as shown in Fig. 2. This 
will significantly save costly RhE tissues for color checking. 
Collectively, we demonstrated that KeraSkin™ SIT showed good 
reproducibility and predictive capacity, meeting the criteria of OECD TG 
439 PS. The results of the validation study which was conducted in 
accordance of OECD TG 439 PS, support that KeraSkin™ SIT is 
compatible with other existing VRMs of OECD TG 439 in terms of 
reproducibility, predictive capacity and applicability domain. There-
fore, we believe that KeraSkin™ SIT may be considered as a me-too 
method similar to RhE SIT VRMs of OECD TG 439. 
Funding sources 
This research was supported by grants from the Ministry of Food and 
Drug Safety in 2018 (18182MFDS463) and National Research Founda-
tion (NRF) funded by Ministry of Science and ICT (MSIT) 
(2018R1A5A2025286). 
Fig. 6. KeraSkin™ SIT viability data for 66 chemicals composed of 23 solids and 43 liquids. Red color indicates falsely predicted substances. Dotted line indicates 
cut-off (50%) and blue line represents 60%, an optimum cut-off suggested by ROC analysis. (For interpretation of the references to color in this figure legend, the 
reader is referred to the Web version of this article.) 
Table 9 
Predictive capacity of KeraSkin™ SIT for 66 chemicals. 
Total (66) Liquid (43) Solid (23) PS 
I NI I NI I NI 
I 20 1 17 0 3 1 
NI 8 37 7 19 1 18 
Total 66 43 23 
Sensitivity 95.2% 100% 75% 80% 
Specificity 82.2% 73.1% 94.7% 70% 
Accuracy 86.4% 83.7% 91.3% 75% 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
13
Fig. 7. ROC curve for 66 substances and 23 solids and 43 liquids. The receiver operating characteristic (ROC) curve was drawn with cell viability data obtained with 
KeraSkin™ SIT. Grey color is the current cut-off, 50% and yellow color is the optimum cut-off value suggested from the analysis. (For interpretation of the references 
to color in this figure legend, the reader is referred to the Web version of this article.) 
Table 10 
Comparison of weighted and unweighted predictive capacity of KeraSkin™ SIT. 
Predictive capacity Unweighted PS criteria Weighted 
Total (66) Liquid (43) Solid (23) Total (66) Liquid (43) Solid (23) 
Sensitivity 95.2% 100% 75% ≥80% 94.7% 99.3% 75% 
Specificity 82.2% 73.1% 94.7% ≥70% 82.2% 73.1% 94.7% 
Accuracy 86.4% 83.7% 91.3% ≥75% 86.2% 83.5% 91.3% 
J. Han et al. 
Regulatory Toxicology and Pharmacology 117 (2020) 104725
14
Declaration of competing interest 
The authors declare that they have no known competing financial 
interests or personal relationships that could have appeared to influence 
the work reported in this paper. 
Acknowledgements 
This research was supported by grants from the Ministry of Food and 
Drug Safety in 2018 (18182MFDS463) and National Research Founda-
tion (NRF) funded by Ministry of Science and ICT (MSIT) 
(2018R1A5A2025286). 
References 
Adler, S., et al., 2011. Alternative (non-animal) methods for cosmetics testing: current 
status and future prospects—2010. Arch. Toxicol. 85, 367–485. 
Akbarsha, M.A., Mascarenhas, B., 2019. Cosmetic Regulation and Alternatives to Animal 
Experimentation in India. Alternatives to Animal Testing. Springer, Singapore, 
pp. 57–62. 
Alépée, N., et al., 2016. Multi-laboratory evaluation of SkinEthic HCE test method for 
testing serious eye damage/eye irritation using solid chemicals and overall 
performance of the test method with regard to solid and liquid chemicals testing. 
Toxicol. Vitro 34, 55–70. 
Atkinson, G., 1998. Statistical methods for assessing measurement error (reliability) in 
variables relevant to sports medicine. Sports Med. 26, 217–238. 
Bartko, J.J., 1976. On various intraclass correlation reliability coefficients. Psychol. Bull. 
83, 762. 
Berry, K.J., et al., 1988. A generalization of Cohen’s kappa agreement measure to 
interval measurement and multiple raters. Educ. Psychol. Meas. 48, 921–933. 
Blackman, N.J.M., 2000. Interval estimation for Cohen’s kappa as a measure of 
agreement. Stat. Med. 19, 723–741. 
Choi, J., et al., 2014. Skin corrosion and irritation test of sunscreen nanoparticles using 
reconstructed 3D human skin model. Environ. Toxicol. 29. 
Cooper 2nd, J., Saracci, R., 1979. Describing the validity of carcinogen screening tests. 
Br. J. Cancer. 39, 87. 
Giavarina, D., 2015. Understanding bland altman analysis. Biochem. Med. 25, 141–151. 
Hirschmann, M., et al., 2011. The position and orientation of total knee replacement 
components: a comparison of conventional radiographs, transverse 2D-CT slices and3D-CT reconstruction. J. Bone Joint Surg. 93, 629–633. 
Hwang, J.-h., et al., 2018. Investigation of dermal toxicity of ionic liquids in monolayer- 
cultured skin cells and 3D reconstructed human skin models. Toxicol. Vitro 46, 
194–202. 
Joo, K.M., et al., 2019. Development and validation of UPLC method for WST-1 cell 
viability assay and its application to MCTT HCE eye irritation test for colorful 
substances. Toxicol. Vitro 60, 412–419. 
Jung, K.-M., et al., 2014. KeraSkin™-VM: a novel reconstructed human epidermis model 
for skin irritation tests. Toxicol. Vitro 28, 742–750. 
Kandárová, H., et al., 2009. In vitro skin irritation testing: improving the sensitivity of 
the EpiDerm skin irritation test protocol. Altern. Lab. Anim. 37, 671–689. 
Katoh, M., Hata, K., 2011. Refinement of LabCyte EPI-MODEL24 skin Irritation test 
method for adaptation to the requirements of OECD test guideline 439. Altern. Anim. 
Test. Exp. 16, 111–122. 
Kim, K., et al., 2017. Anti-pigmentary effect of (-)-4-Hydroxysattabacin from the marine- 
derived Bacterium Bacillus sp. Mar. Drugs 15, 138. 
Koo, T.K., Li, M.Y., 2016. A guideline of selecting and reporting intraclass correlation 
coefficients for reliability research. J. Chiropr. Med. 15, 155–163. 
Kose, O., et al., 2018. Evaluation of skin irritation potentials of different cosmetic 
products in Turkish market by reconstructed human epidermis model. Regul. 
Toxicol. Pharmacol. 98, 268–273. 
Landis, J.R., Koch, G.G., 1977. The measurement of observer agreement for categorical 
data. Biometrics 33, 159–174. 
Lee, M., et al., 2017. Alternatives to in vivo draize rabbit eye and skin irritation tests with 
a focus on 3D reconstructed human cornea-like epithelium and epidermis models. 
Toxicol. Res. 33, 191–203. 
Lee, S.-H., et al., 2016. Evaluating the micronucleus induction potential for the 
genotoxicity assay using the human skin model, KeraSkin TM. J Soc. Cosmet. Sci. 
Korea 42, 211–216. 
Lim, S.-E., et al., 2019a. Statistical analysis of the reproducibility and predictive capacity 
of MCTT HCE™ eye irritation test, a me-too test method for OECD TG 492. Regul. 
Toxicol. Pharmacol. 107, 104430. 
Lim, S.E., et al., 2019b. Me-too validation study for in vitro eye irritation test with 3D- 
reconstructed human cornea epithelium, MCTT HCE(TM). Toxicol. Vitro 55, 
173–184. 
OECD, 2002. Test No. 404 : Acute Dermal Irritation/Corrosion. OECD Publishing, paris, 
p. 13. 
OECD, 2010. Test no. 439: in vitro skin irritation. In: OECD. OECD Publishing, Paris. 
OECD, 2015a. In Vitro Skin Irritation: Reconstructed Human Epidermis Test Method, vol. 
439. OECD Publishing, paris. 
OECD, 2015b. No. 220 performance standards for the assessment of proposed similar or 
modified in vitro reconstructed human epidermis (RhE) test methods for skin 
irritation testing as described in TG 439. In: OECD. OECD Publishing, Paris. 
OECD, 2017. Guidance document on an integrated approach on testing and assessment 
(IATA) for skin corrosion and irritation. In: OECD. OECD Publishing, Paris. 
OECD, 2019a. Test No. 439: In Vitro Skin Irritation: Reconstructed Human Epidermis 
Test Method. OECD Publishing, Paris. 
OECD, 2019b. Reports of the Peer Reviews of the epiCS and Skin+ Test Methods in View 
of Their Inclusion in Test Guideline 439 on In Vitro Skin Irritation. 
Parish, S.T., et al., 2020. An evaluation framework for new approach methodologies 
(NAMs) for human health safety assessment. Regul. Toxicol. Pharmacol. 112, 
104592. 
Park, J., et al., 2018. Mixture toxicity of methylisothiazolinone and propylene glycol at a 
maximum concentration for personal care products. Toxicol. Res. 34, 355–361. 
Park, S.H., et al., 2004. Receiver operating characteristic (ROC) curve: practical review 
for radiologists. Korean J. Radiol. 5, 11–18. 
Shrout, P.E., Fleiss, J.L., 1979. Intraclass correlations: uses in assessing rater reliability. 
Psychol. Bull. 86, 420. 
Spielmann, H., et al., 2007. The ECVAM international validation study on in vitro tests 
for acute skin irritation: report on the validity of the EPISKIN and EpiDerm assays 
and on the Skin Integrity Function Test. Altern. Lab. Anim. 35, 559–601. 
Szklo, M., Nieto, F.J., 2014. Epidemiology: Beyond the Basics. Jones & Bartlett 
Publishers. 
Yang, H., et al., 2017. Prevalidation trial for a novel in vitro eye irritation test using the 
reconstructed human cornea-like epithelial model, MCTT HCE™. Toxicol. Vitro. 39, 
58–67. 
Youngstrom, E.A., 2014. A primer on receiver operating characteristic analysis and 
diagnostic efficiency statistics for pediatric psychology: we are ready to ROC. 
J. Pediatr. Psychol. 39, 204–221. 
J. Han et al. 
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref1
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref1
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref2
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref2
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref2
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref3
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref3
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref3
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref3
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref4
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref4
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref5
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref5
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref6
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref6
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref7
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref7
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref8
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref8
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref9
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref9
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref10
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref11
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref11
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref11
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref12
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref12
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref12
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref13
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref13
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref13
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref14
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref14
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref15
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref15
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref16
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref16
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref16
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref17
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref17
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref18
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref18
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref19
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref19
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref19
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref20
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref20
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref21
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref21
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref21
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref22
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref22
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref22
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref23
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref23
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref23
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref24http://refhub.elsevier.com/S0273-2300(20)30151-3/sref24
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref24
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref25
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref25
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref26
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref27
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref27
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref28
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref28
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref28
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref29
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref29
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref30
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref30
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref31
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref31
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref32
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref32
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref32
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref33
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref33
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref34
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref34
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref35
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref35
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref36
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref36
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref36
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref37
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref37
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref38
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref38
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref38
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref39
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref39
http://refhub.elsevier.com/S0273-2300(20)30151-3/sref39
	Me-too validation study for in vitro skin irritation test with a reconstructed human epidermis model, KeraSkin™ for OECD te ...
	1 Introduction
	2 Material and methods
	2.1 Chemicals and reagents
	2.2 KeraSkin™ SIT protocol 1.5
	2.2.1 Washing
	2.2.2 MTT assay
	2.2.3 Detecting and correcting interference from colored chemicals and MTT reducers
	2.2.4 Prediction model for the KeraSkin™ SIT
	2.2.5 Acceptance criteria
	2.3 Statistical analysis for the reliability of the KeraSkin™ SIT
	2.3.1 Data collection and chart
	2.3.2 Within-laboratory and between-laboratory reproducibility
	2.3.3 Correlation with validated reference methods
	2.4 Predictive capacity of the KeraSkin™ SIT
	2.4.1 20 chemicals and 46 additional chemicals for the evaluation of predictive capacity
	3 Results
	3.1 Within-and between-laboratory reproducibility
	3.2 Correlation of KeraSkin™ SIT with validated reference methods of OECD TG 439
	3.3 Predictive capacity of KeraSkin™ SIT for 20 reference chemicals
	3.4 Color interference
	3.5 Predictive capacity of KeraSkin™ SIT for 66 chemicals
	4 Discussion
	Funding sources
	Declaration of competing interest
	Acknowledgements
	References

Continue navegando