Baixe o app para aproveitar ainda mais
Prévia do material em texto
Regulatory Toxicology and Pharmacology 117 (2020) 104725 Available online 5 August 2020 0273-2300/© 2020 Elsevier Inc. All rights reserved. Me-too validation study for in vitro skin irritation test with a reconstructed human epidermis model, KeraSkin™ for OECD test guideline 439 Juhee Han a, Seolyeong Kim b, Su-Hyun Lee b, Jin-Sik Kim c, Yu Jin Chang d, Tae-Cheon Jeong e, Mi-Jeong Kang e, Tae-Sung Kim f, Hae Seong Yoon f, Ga Young Lee f, SeungJin Bae a,**, Kyung-Min Lim a,* a College of Pharmacy, Ewha Womans University, Seoul, Republic of Korea b Biosolution Co, Seoul, Republic of Korea c COSMAX Korea, Seongnam-si, Republic of Korea d Korea Conformity Laboratories, Incheon, Republic of Korea e College of Pharmacy, Yeungnam University, Gyeongsan, Republic of Korea f National Institute of Food and Drug Safety Evaluation, Ministry of Food and Drug Safety, Osong, Republic of Korea A R T I C L E I N F O Keywords: KeraSkinTM Reconstructed human epidermis OECD TG 439 Reproducibility Predictive capacity Skin irritation A B S T R A C T We conducted a me-too validation study to confirm the reproducibility, reliability, and predictive capacity of KeraSkin™ skin irritation test (SIT) as a me-too method of OECD TG 439. With 20 reference chemicals, within- laboratory reproducibility (WLR) of KeraSkin™ SIT in the decision of irritant or non-irritant was 100%, 100%, and 95% while between-laboratory reproducibility (BLR) was 100%, which met the criteria of performance standard (PS, WLR≥90%, BLR≥80%). WLR and BLR were further confirmed with intra-class correlation (ICC, coefficients >0.950). WLR and BLR in raw data (viability) were also shown with a scatter plot and Bland-Altman plot. Comparison with existing VRMs with Bland-Altman plot, ICC and kappa statistics confirmed the compat- ibility of KeraSkin™ SIT with OECD TG 439. The predictive capacity of KeraSkin™ SIT was estimated with 20 reference chemicals (the sensitivity of 98.9%, the specificity of 70%, and the accuracy of 84.4%) and additional 46 chemicals (for 66 chemicals [20 + 46 chemicals, the sensitivity, specificity and accuracy: 95.2%, 82.2% and 86.4%]). The receiver operating characteristic (ROC) analysis suggested a potential improvement of the pre- dictive capacity, especially sensitivity, when changing cut-off (50% → 60–75%). Collectively, the me-too vali- dation study demonstrated that KeraSkin™ SIT can be a new me-too method for OECD TG 439. 1. Introduction The European Commission has prohibited animal experiments for cosmetics in 2013 and has been working on developing alternative test methods to replace animal testing based on the ‘3R’s Principle’ (Adler et al., 2011). Since the EU ban on animal testing for cosmetics, more than 37 countries worldwide have legally prohibited animal experi- ments for the development of cosmetics (Akbarsha and Mascarenhas, 2019). In addition, there is an increasing demand for new approach methodologies (NAMs) to evaluate the safety of numerous chemicals on human health in various sectors (Parish et al., 2020). In vitro skin irri- tation test, OECD TG 439, has been developed to replace OECD TG 404 (OECD, 2002) in which rabbits are used as a test species to evaluate the skin irritancy of chemicals and cosmetic products (Kose et al., 2018; Park et al., 2018). OECD TG 439 uses a reconstructed human epidermis (RhE) (OECD, 2015a) and can be used as a stand-alone to identify UN GHS No category chemical, i.e., non-irritant or in combination with other replacement methods, such as OECD TG 435 “In Vitro Membrane Barrier Test Method for Skin Corrosion” (OECD, 2015b) to further classify the hazard on skin in the framework of Integrated Approach on Abbreviations: OECD, The Organisation for Economic Co-operation and Development; TG, Test guideline; PS, Performance Standards; VRM, Validated Reference Method; SIT, skin irritation test; WLR, Within-laboratory reproducibility; BLR, Between-laboratory reproducibility; RhE, Reconstructed human Epidermis; CV, cell viability; I, irritant; NI, non-irritant; GHS, Globally Harmonized System of Classification and Labelling of Chemicals. * Corresponding author. ** Corresponding author. E-mail addresses: sjbae@ewha.ac.kr (S. Bae), kmlim@ewha.ac.kr (K.-M. Lim). Contents lists available at ScienceDirect Regulatory Toxicology and Pharmacology journal homepage: www.elsevier.com/locate/yrtph https://doi.org/10.1016/j.yrtph.2020.104725 Received 4 March 2020; Received in revised form 25 June 2020; Accepted 2 July 2020 mailto:sjbae@ewha.ac.kr mailto:kmlim@ewha.ac.kr www.sciencedirect.com/science/journal/02732300 https://www.elsevier.com/locate/yrtph https://doi.org/10.1016/j.yrtph.2020.104725 https://doi.org/10.1016/j.yrtph.2020.104725 https://doi.org/10.1016/j.yrtph.2020.104725 http://crossmark.crossref.org/dialog/?doi=10.1016/j.yrtph.2020.104725&domain=pdf Regulatory Toxicology and Pharmacology 117 (2020) 104725 2 Testing and Assessment (IATA) for Skin Corrosion and Irritation (OECD, 2017). Performance standards (PS) of OECD TG 439 for the assessment of similar and modified RhE-based test methods, OECD guidance docu- ment 220 (OECD, 2015b) was developed to encourage the introduction of new ‘‘me-too’’ RhE models and methods. At the time of original adoption of OECD TG 439 (OECD, 2010), EpiDerm™ SIT and EpiSkin™ SIT were only two validated reference methods (VRMs). Since then, various me-too skin irritation tests like SkinEthic™ RhE SIT and Labcyte EPI-MODEL24 SIT were developed, validated and approved for OECD TG 439 in 2013. In 2019, OECD TG 439 was updated with two additional me-too test methods, epiCS® and Skin+® (OECD, 2019a; 2019b). KeraSkin™ is a newly developed RhE using human primary kerati- nocytes to evaluate skin irritation of chemicals (Jung et al., 2014). Several studies have demonstrated that KeraSkin™ can be applied to the skin toxicity in various settings (Choi et al., 2014; Hwang et al., 2018; Jung et al., 2014; Kim et al., 2017; Lee et al., 2016, 2017). Here, we conducted a me-too validation test to confirm the reproducibility, and predictive capacity of KeraSkin™ SIT as a me-too method’ of OECD TG 439 in accordance of OECD TG 439 PS with 20 reference chemicals. Additional 46 chemicals were tested to further evaluate the predictive capacity of KeraSkin™ SIT. In addition to the evaluation of WLR, BLR and predictive capacity based on proportion of concordance of decisions, various statistical methods have been used to assess the WLR and BLR, and compatibility to existing VRMs (Alépée et al., 2016; Lim et al., 2019a, 2019b). Intra-class Correlation Coefficient (ICC) is a commonly used indicator to assess reproducibility and is an estimate of the fraction of the total measurement variability caused by individual appreciator variation (Szklo and Nieto, 2014). It was also used in the analysis of WLR and BLR (Hirschmann et al., 2011). ICC values less than 0.5 are indicative of poor reliability, values between 0.5 and 0.75 indicate moderate reliability, values between 0.75 and 0.9 indicate good reliability, and values greater than 0.90 indicate excellent reliability (Koo and Li, 2016). The Bland-Altman plot is also recommended for comparative studies by showing the relationship between the mean and the difference in values obtained from each test result. It is very useful in examining patterns of inconsistencies between measurements (Atkinson, 1998). Bland-Altman plot shows three horizontal lines parallel to the x-axis, representing the mean differences between the measured values, and ±95% limit of agreement. Bland-Altman plot was recommended that 95% of the data points should lie within ±95% limit of agreement (Giavarina, 2015). In addition, Cohen’s kappa statistics is often used to measure the concordance between two observers for normal scales (Berry et al., 1988; Blackman, 2000). Kappa statistics can vary from − 1 to 1 (Blackman, 2000; Landisand Koch, 1977). The kappa value less than 0.00 indicates poor, 0.00–0.20, slight, 0.21–0.40, fair and 0.41–0.60, moderate, 0.61–0.80, substantial and 0.81–1.00, almost perfect concordance (Landis and Koch, 1977). For the assessment of optimal cutoffs, receiver operating character- istic (ROC) curve analysis is an effective method to assess quality or performance (Park et al., 2004; Yang et al., 2017). In addition, to pro- vide 95% confidence interval for predictive capacity with a small sample number, Wilson’s score confidence interval was used (Alépée et al., 2016). With the various statistical analysis, we could demonstrated that KeraSkin™ SIT exhibits good reproducibility and predictive capacity that can meet the criteria of OECD TG 439 PS, which could contribute to reducing animal tests by providing with another new method to replace the rabbit skin irritation test. 2. Material and methods 2.1. Chemicals and reagents The 20 reference chemicals as suggested in the OECD TG 439 per- formance standards were used as described in Table 1. 20 chemicals consisted of 10 non-irritants (top 10 in Table 1) and 10 irritants (bottom 10 in Table 1) according to the UN GHS category, of which 4 were solids and 16, liquids. Also, 46 chemicals (19 solids and 27 liquids) were selected from previous validation study reports and papers to further Table 1 20 reference chemicals recommended from GD 220 published in 2015. No. Chemicals CASN UN GHS Cat. Physical Status Storage condition Manufacturer Cat No. Purity (%) Non-classified chemicals 1 1-Bromo-4-chlorobutane 6940-78- 9 No Cat. liquid RT Sigma B60800 99.0 2 Diethyl phthalate 84-66-2 No Cat. liquid RT Sigma 524972 99.9 3 Naphthalene acetic acid 86-87-3 No Cat. solid RT Sigma N0640 97.0 4 Allyl phenoxy-acetate 7493-74- 5 No Cat. liquid RT Sigma W203807 99.8 5 Isopropanol 67-63-0 No Cat. liquid RT Sigma W292907 100.0 6 4-Methyl-thio-benzaldehyde 3446-89- 7 No Cat. liquid RT Sigma 222771 98.9 7 Methyl stearate 112-61-8 No Cat. solid RT Sigma 85769 99.2 8 Heptyl butyrate 5870-93- 9 No Cat. (Optional Cat. 3) liquid RT Sigma W254908 99.4 9 Hexyl salicylate 6259-76- 3 No Cat. (Optional Cat. 3) liquid RT Sigma W520306 99.5 10 Cinnamaldehyde 104-55-2 No Cat. (Optional Cat. 3) liquid RT Sigma W228613 99.1 Classified chemicals 11 1-Decanol 112-30-1 Cat. 2 liquid RT Sigma 239763 99.1 12 Cyclamen aldehyde 103-95-7 Cat. 2 liquid RT TCI I0377 97.8 13 1-bromohexane 111-25-1 Cat. 2 liquid RT Sigma B68240 99.3 14 2-Chloromethyl-3,5-dimethyl-4- methoxypyridine HCL 86604- 75-3 Cat. 2 solid RT Sigma 535508 ≥99.9 15 Di-n-propyl disulphide 629-19-6 Cat. 2 liquid RT Sigma 149225 97.8 16 Potassium hydroxide(5% aq.) 1310-58- 3 Cat. 2 liquid RT Sigma 417661 – 17 Benzenethiol, 5-(1,1-dimethylethyl)- 2-methyl 7340-90- 1 Cat. 2 liquid RT ACROS 279960500 93.9 18 1-Methyl-3-phenyl-1-piperazine 5271-27- 2 Cat. 2 solid RT AK scientific A083 99.0 19 Heptanal 111-71-7 Cat. 2 liquid RT Sigma W254002 97.7 20 Tetrachloroethylene 127-18-4 Cat. 2 liquid RT Sigma 371696 100.0 J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 3 assess predictive capacity. In total, 66 chemicals composed of 23 solids and 43 liquids (21 irritants and 45 non-irritants) were tested in further evaluating the predictive capacity of KeraSkin™ SIT. Participating lab- oratories (Biosolution Co, Seoul, Korea, COSMAX Korea, Seongnam-si, Korea and Korea Conformity Laboratories, Incheon, Korea) tested all the substances under the blind conditions. 2.2. KeraSkin™ SIT protocol 1.5 Overall procedure of KeraSkin™ SIT was described in Fig. 1. Original protocol of KeraSkin™ SIT was developed in 2012, which was estab- lished through adopting the existing VRMs of OECD TG 439. Five times of revisions were made to optimize KeraSkin™ SIT protocol and to correct mis-predictions for some PS chemicals with respect to washing procedure (employment of squeeze bottle and cotton-swab), removal of mesh application, treatment volume (from 30 to 40 μL) and treatment time (from 40 min to 30 min). Viscous substance sometimes made the mesh stick to the tissue even after the washing process, so the mesh was removed. Instead the treat- ment volume was increased (30 μL→40 μL) to evenly distribute the substance on the tissue and the treating time (45 min→30 min) was shortened to compensate the increased treatment volume. Post- incubation time of 42 h is the same as the existing VRMs. Briefly, the KeraSkin™ model, which passed the quality control [optical density values of negative control (NC) by MTT (3-(4,5-dime- thylthiazol-2-yl)-2,5-diphenyltetrazolium bromide) assay: 0.5–1.2, ET50: 6.9.0–14.0 h, IC50 for sodium dodecyl sulfate (SDS) 1.5–4.7 mg/ mL and histological examination] of the manufacturer (Biosolution Co. Seoul, Korea) was delivered to the testing labs in a 24-well format on an agarose gel at about 25 ◦C. Actual QC data for the 13 batches used for the main study was NC OD: 0.9 ± 0.1, ET50: 10.7 ± 1.5, IC50: 3.3 ± 0.4 (Mean ± SD). It was confirmed through an oversea shipment that Ker- aSkin™ was stable up to 56 h of delivery as determined by quality control criteria of NC OD and IC50 for SDS (data not shown). Upon receipt of the shipment, culture medium provided by the manufacturer was pre-warmed in a 37 ◦C thermostat for 30 min. As for the pre- incubation step, 900 μL of the pre-warmed medium was added to each well of a 6-well plate using micropipette and the RhE model insert was carefully transferred to a well using forceps. Then the well-plate was pre- incubated at 37 ◦C, 5% CO2 for 22 ± 2 h. To conduct the experiment, 40 Fig. 1. KeraSkinTM SIT protocol 1.5. Overall KeraSkin™ skin irritation test (SIT) procedure. CV, cell viability, NI, non-irritant. J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 4 μL of liquid substance or solution, or 40 mg (with 40 μL DPBS) of solid substance was topically applied on the upper epithelial surface of the model insert (0.6 cm2) after the pre-incubation. The tissue was incu- bated again (37 ◦C, 5% CO2 condition) for 30 ± 1 min (Fig. 2). Then the tissue was washed to remove the test substances and further incubated (37 ◦C, 5% CO2 condition) for 42 ± 2 h. The resulting tissue viability was determined by MTT assay. 2.2.1. Washing Washing procedure for the 6-wells should be done within 20 min. DPBS for washing should be kept warm in a 37 ◦C thermostat before use. The treated tissue was removed from a 37 ◦C, 5% CO2 incubator and DPBS was flowed on the side wall of the insert using a poly wash bottle, and when DPBS was fully filled in the insert, DBPS in the insert was poured to a beaker at once. Washing was repeated 10 times in total. However, if residual material (i.e. oils or highly viscous liquid materials, and solid materials with gelation) was still observed after 5 times of wash, a cotton swab was used to wipe off the residual materials. After washing, DPBS outside of the insert were removed using a sterile gauze. 2.2.2. MTT assay After post-incubation, the media inside and outside of inserts were removed with a pipette. Inserts were transferred into 24-well plates, prefilled with 200 μL of 0.4 mg/mL MTT per a well. Then, 100 μL of MTT solution was further added to the inside of insert. Then the plate was covered with aluminum foil to protect from the light and placed in an incubator (37 ◦C, 5% CO2) for 3 h ± 5 min. Afterincubation, MTT so- lution remaining was removed from the inside/outside of the insert using 200 μL pipette and the tissues were transferred to a new 6-well plate containing 1.9 mL/well of isopropanol. Then additional 100 μL of isopropanol was added to the inside of the insert. Plate was covered from the light using aluminum foil and shaken on a plate shaker for 3 h ± 5 min to extract formazan. After formazan extraction, the extracts inside and outside of the insert were collected and mixed well by pipetting such that formazan crystals were dissolved clearly. 250 μL of extract per a well was transferred into a 96 well plate, for the optical density (OD) measurement at 570 nm. 2.2.3. Detecting and correcting interference from colored chemicals and MTT reducers In order to prevent the possible interference with the MTT reading by colored chemicals, i.e., ‘direct staining’ or ‘MTT reacting’ chemicals, were pre-checked. Though, when a test substance is determined to be an irritant, the correction shall be not necessary since correction always over-predict the classification of skin irritancy by subtracting the cell viability. Accordingly, the color interference corrections may be reserved as optional for the test chemicals with significant concerns. Overall scheme for correcting color interference could be advised as described in Fig. 2. Colored test substance may interfere with OD measurement of MTT formazan by directly staining the tissue. 0.5 mL of deionized water and 40 μL (liquid) or 40 mg (solid) of the test chemical were added to each Eppendorf tube and the mixture was incubated in a cell incubator (37 ± 1 ◦C, 5 ± 1% CO2, 95% RH) for 30 min. The color changes were observed, and if there was a significant color change, correction for color interference with the viability value calculated from OD of the staining control [viability value of a tissue incubated with phenol red free-DMEM instead of MTT solution (%NSCliving)] was necessary. Fig. 2. Color interference post-checking scheme for KeraSkin™ SIT. CV, cell viability, %TT, Uncorrected viability value, %NWliving, viability value of a tissue incubated with phenol red free-DMEM instead of MTT solution, %NSMTT, viability value of reactivity control obtained with OD from the test done with freeze-killed tissues, %NSCKilled, viability value of freeze-killed tissue incubated with phenol red free-DMEM instead of MTT solution. J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 5 Table 2 Within- and between-laboratory reproducibility of the Keraskin™ SIT test results. Fig. 3. Scatter plot of 3 runs for 20 PS reference chemicals by 3 laboratories. Cell viability data from the KeraSkin™ SIT. Scatter plot with the mean± SD (error bar) of 3 runs for each lab. Biosolution (BS), KCL and COSMAX. J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 6 A test substance with MTT reacting activity may reduce MTT directly and form formazan alone. 0.5 mL of MTT solution (0.4 mg/mL) and 40 μL (liquid) or 40 mg (solid) of the test chemical were added to an Eppendorf tube and the mixture was incubated in a cell incubator (37 ◦C, 5% CO2, 95% RH) for 1 h. The color change was observed and if the color of MTT solution changes to blue/purple, correction for color interference with the viability value of reactivity control obtained with OD from the test done with freeze-killed tissues (%NSMTT) was necessary. If a test substance was both direct-staining and MTT-reacting, then the correction needs to be done for both direct-staining and MTT reac- tivity, but the color interference would be subtracted twice. To resolve this, another staining control with freeze-killed tissues [viability value of freeze-killed tissue incubated with phenol red free-DMEM instead of MTT solution (%NSCKilled)] is necessary. 2.2.4. Prediction model for the KeraSkin™ SIT According to the EU and the GHS classification, an irritant is pre- dicted if the mean relative tissue viability of minimum three replicate tissues exposed to the test chemical is reduced to 50% or below of the mean viability of the negative controls. The test chemical was defined as a non-irritant if the tissue viability was higher or equal to 50%. Other- wise, it was determined as an irritant. Since 50% cut-off value is adopted by all existing VRMs of OECD TG 439, KeraSkin™ SIT protocol has been optimized to 50% cut-off. 2.2.5. Acceptance criteria A test is regarded as acceptable unless the results do not fall into the criteria for a re-test, which is adopted for KeraSkin™ SIT from OECD TG 439, 1) If absorbance value of the negative control chemical is less than 0.7 or exceeds 1.6. 2) If a cell viability value of the positive control chemical is higher than 40%. 3) In negative control chemical, positive control chemical, and test chemical: When the standard deviation of the cell viability value of treated tissue in replications exceeds 18%. 4) If the average cell survival rate of the chemical-treated-well is be- tween 45% and 55% (≥45% and ≤55%, borderline chemicals). If the re-test gives borderline value again, the concordance of the decision should be considered. If the original and re-test are concordant in decision, additional re-test is not necessary. If the decision is different, then the third test should be conducted and majority vote approach should be taken for final decision. Acceptance criteria for KeraSkin™ SIT were established referring to existing VRMs of OECD TG 439. 2.3. Statistical analysis for the reliability of the KeraSkin™ SIT 2.3.1. Data collection and chart Cell viability data were obtained for 20 reference chemicals from a total of three repetitions of KeraSkin™ SIT by three laboratories (total 9 runs of data for a chemical were obtained). 46 additional chemicals were tested by two laboratories, Biosolution and KCL (One or two runs of data for a chemical was obtained). All participating laboratories performed the tests and quality assurance in the spirit of Good Laboratory Practice (GLP). The cell viability was shown in the table as mean± SD. Also, the results were plotted on a scatter plot to show the data distribution at a glance, with the cut-off line to identify incorrect decisions. 2.3.2. Within-laboratory and between-laboratory reproducibility Within-laboratory reproducibility (WLR) is an evaluation of the consistency of the results of three replicate tests conducted by a labo- ratory and between-laboratory reproducibility (BLR) is an evaluation of the consistency between the three laboratories using the mean value of results from different independent laboratory tests of the reference chemical. WLR and BLR for 20 chemicals were presented as a percentage. WLR and BLR were also expressed in confidence intervals using Wilson’s score confidence interval method to consider the uncertainty in the value of the point estimate presented. ICC and Bland-Altman methods were also used for the reproducibility analysis. The formula for the ICC is, ICC= Vb VT = Vb Vb + Ve in which Vb = variance between individuals, VT = total variance, which includes both Vb and Ve [Ve = unwanted variance (“error”)] (Szklo and Nieto, 2014). There are two types of ICC: Single measures ICC is an index for the reliability of the ratings for one, typical, single rater while average measures ICC is of different raters averaged together. (Shrout and Fleiss, 1979) The average measures ICC is always higherthan the single measures ICC. (Bartko, 1976) 2.3.3. Correlation with validated reference methods In order to determine the compatibility of KeraSkin™ SIT with validated reference methods (VRMs) of OECD TG 439; EpiDerm™ SIT, EpiSkin™, and LabCyte EPI-MODEL24 SIT, ICC analysis, Bland-Altman plot and Cohen’s kappa analysis were used to evaluate the agreement. 2.4. Predictive capacity of the KeraSkin™ SIT 2.4.1. 20 chemicals and 46 additional chemicals for the evaluation of predictive capacity The mis-predicted test/total was used to determine the predictive capacity (Alépée et al., 2016) where in all test results of each laboratory were used. The judgment are classified as irritant (I) and non-irritant (NI) and concordance of the reference values and test results was shown in 2 × 2 contingency table. The result of 2 × 2 contingency table judgment was calculated with various predictive power indicators (Cooper and Saracci, 1979). Predictive capacity was calculated as an index of sensitivity, specificity, and accuracy. The results were presented as point estimates and 95% confidence intervals were calculated using Wilson’s score confidence interval. According to the OECD TG 439 Performance Standards, the sensitivity, specificity, and accuracy shall be 80%, 70%, and 75%, respectively or higher. The ROC curve was used to evaluate the predictive capacity of Ker- aSkin™ SIT. The additional predictive capacity assessment consisted of Table 3 Intra-class correlation coefficient of the test results of 3 repetitions in 3 labs. Intra-class correlation coefficient (ICC) 95% confidence interval Lower bound Upper bound WLR BS Single measures .971 .940 .987 Average measures .990 .979 .996 KCL Single measures .976 .949 .989 Average measures .992 .983 .996 COSMAX Single measures .957 .912 .981 Average measures .985 .969 .994 BLR Single measures .961 .921 .983 Average measures .987 .972 .994 J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 7 Fig. 4. Bland-Altman plots for evaluating the BLR between two laboratories. Bland-Altman plots for a) BS and KCL, b) BS and COSMAX, c) KCL-COSMAX. Dotted lines indicate 95% confidence limits. Blue circle represents liquid chemicals and red circle, solids. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.) J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 8 66 chemicals with 46 chemicals added to the existing 20 chemicals. 66 chemicals have unequal numbers of test results. To correct this, a weighted approach analysis was also performed for the analysis of predictive capacity. 3. Results 3.1. Within-and between-laboratory reproducibility To investigate whether KeraSkin™ SIT met the criteria of “me-too” test method evaluation stated in the performance standards (PS) for OECD TG 439, 20 reference chemicals were subject to three repeated tests by three laboratories (Table 2 and Fig. 3). Overall decisions were mostly consistent within and between laboratories except for Di-n- propyl disulphide for which Lab 3 incorrectly predicted once. Within- laboratory reproducibility (WLR) and between-laboratory reproduc- ibility (BLR) in cell viability and decisions were also presented with a scatter plot which showed small scatters of viability data for each chemical except for Di-n-propyl disulphide (No. 5). In terms of concor- dance in decision of “non-irritant” or “irritant”, Lab 1 and Lab 2 showed 100% (20/20) and Lab 3, 95% (19/20) WLRs while BLR of three labo- ratories was 100% (20/20). These results met the criteria of WLR ≥ 90% and BLR ≥ 80% of OECD TG 439 PS. ICC was also calculated to evaluate WLR and BLR of cell viability results of KeraSkin™ SIT. Both the single measures and average mea- sures ICC of WLR for BS, KCL and COSMAX, and BLR were all above 0.9 (Table 3), suggesting an excellent level of WLR and BLR of KeraSkin™ SIT. Also, Bland-Altman plot were drawn to evaluate BLR as shown in Fig. 4, which suggests that most of values fell within 95% agreement limit, and the difference between the measured values was close to zero, regardless of the physical states. Interestingly it could be seen that mean CV values aggregated around 0 or 100%, reflecting an excellent iden- tification of irritants from non-irritants. 3.2. Correlation of KeraSkin™ SIT with validated reference methods of OECD TG 439 KeraSkin™ SIT was compared with other existing validation refer- ence methods (VRMs) of OECD TG 439 using ICC, Bland-Altman plot and Kappa statistics. As shown in Table 4, the average ICC between KeraSkin™ SIT and Episkin™ SIT was 0.972 (95% CI: 0.929–0.989), EpiDerm™ SIT, 0.966 (95% CI: 0.914–0.986), and LabCyte EPI- MODEL24 SIT, 0.993 (95% CI: 0.983–0.997), suggesting an excellent correlation. Also, the Bland-Altman plot was distributed close to zero, as shown in Fig. 5. There was no difference between the liquids and the solids but viability values of KeraSkin™ SIT tended to be lower than VRMs. Especially, 4-methyl-thio-benzaldehyde and Di-n-propyl disul- phide were outliers for Episkin™-KeraSkin™, and EpiDerm™- and LabCyte EPI-MODEL24-KeraSkin™ pairs, reflecting that KeraSkin™ SIT may be more sensitive. In the Kappa statistics (Table 5), Episkin™- KeraSkin™ showed 0.608, i.e., substantial concordance while Epi- Derm™- and LabCyte EPI-MODEL24-KeraSkin™ were 0.828, almost perfect concordances. 3.3. Predictive capacity of KeraSkin™ SIT for 20 reference chemicals The predictive capacity of KeraSkin™ SIT, i.e., sensitivity, specificity and accuracy was evaluated with three repeated tests for 20 PS chem- icals by three laboratories (Table 6). With all 180 individual runs, mis- predicted test/total was calculated. The result showed the sensitivity of 98.9%, specificity of 70%, and accuracy of 84.4%. Wilson’s 95% confidence interval of sensitivity was 0.940–0.998, specificity was 0.599–0.785 and accuracy was 0.784–0.890. 3.4. Color interference All three participating laboratories performed the correction pro- cedure for color interference (Fig. 2) for all 20 reference chemicals to pre-check the possibility of direct staining and MTT reactivity. No correction was necessary for the 20 PS chemicals in Lab 1 and Lab 2, but in Lab 3, decision of a chemical, Benzenethiol, 5-(1,1-dimethylethyl)-2- methyl was changed from non-irritant to irritant due to a substantial MTT reactivity as shown in Table 7. 3.5. Predictive capacity of KeraSkin™ SIT for 66 chemicals Additional 46 reference chemicals (Table 8) with in vivo Draize score available were selected and tested to further evaluate the predictive capacity of KeraSkin™ SIT. Overall, data for 66 chemicals (46 + 20 PS chemicals) was obtained (Fig. 6). KeraSkin™ SIT showed a sensitivity of 95.2%, specificity of 82.2% and accuracy of 86.4% for those 66 chem- icals, which met OECD PS criteria. The predictive capacity was further calculated for liquids (43) and solids (23) separately, which also satis- fied the criteria (liquids; sensitivity of 100%, specificity of 73.1% and accuracy of 83.7%, and solids; 75%, 94.7%, 91.3%) (see Table 9). The ROC curve analysis was conducted with viability data for 66 chemicals. As shown in Fig. 7, cut-off value of 60–80% was optimum for solids while 45–75% was for liquids. Overall, optimal cut-off values were determined to be 60–75%. The AUC was estimated to be 0.7235, whichrepresents a reasonable level of predictive capacity (Park et al., 2004; Youngstrom, 2014). Also, the weighted approach analysis was conducted to correct unequal number of test results across chemicals, which showed the sensitivity 94.7%, specificity 82.2%, and accuracy 86.2% (Table 10). 4. Discussion This study was conducted to determine if the KeraSkin™ SIT satisfies the performance standards (PS) of OECD TG 439. The assessment of reproducibility and predictive capacity of the KeraSkin™ SIT test method was performed using 20 PS chemicals and additional 46 chemicals were tested to further assess the predictive capacity. Ker- aSkin™ SIT has achieved a WLR of 95%–100% in all three participating Table 4 Intra-class correlation coefficient of the test results between the KeraSkin™ SIT and validated reference methods. KeraSkin™ SIT Intra-class correlation coefficient (ICC) 95% confidence interval Lower bound Upper bound Episkin™ Single measures .945 .867 .978 Average measures .972 .929 .989 EpiDerm™ SIT Single measures .934 .841 .973 Average measures .966 .914 .986 LabCyte EPI- MODEL24 SIT Single measures .986 .966 .995 Average measures .993 .983 .997 J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 9 Fig. 5. Bland-Altman plots for evaluating the agreement between KeraSkin™ SIT and existing VRMs of OECD TG 439. Bland-Altman plots for a) EpiSkin™-Ker- aSkin™ SIT, b) EpiDerm™ SIT-KeraSkin™ SIT, c) LabCyte EPI-MODEL24 SIT- KeraSkin™ SIT. Dotted lines indicate 95% confidence limits. Blue circle represents liquid chemicals and red circle, solids. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.) J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 10 laboratories and a BLR of 100% in the three participating laboratories. Predictive capacity of the KeraSkin™ SIT (sensitivity, specificity and accuracy: 98.9%, 70%, 84.4%) for 20 reference chemicals was also better or equal to the PS criteria (sensitivity, specificity and accuracy: ≥80%, ≥70% and ≥75%). With additional chemicals, predictive ca- pacity was a sensitivity of 95.2%, a specificity of 82.2% and an accuracy of 86.4% for 66 chemicals (20 PS chemicals + 46 additional chemicals). There were 4 false predictions for the non-irritant chemicals. 1- bromo-4-chloroethylene (no. 11), 4-methyl-thio-benzaldehyde (no. 16), cinnamaldehyde (no. 20) were incorrectly predicted in all three laboratory. This is considered to be due to the characteristic of the materials or limitation of RhE SITs since other VRMs like EpiDerm™ SIT and LabCyte EPI-MODEL24 SIT methods also predicted them incorrectly (Kandárová et al., 2009; Katoh and Hata, 2011; Spielmann et al., 2007). Interestingly, KeraSkin™ SIT can predict the irritant, Di-n-propyl disulphide, correctly except for one run by Lab 3. Di-n-propyl disul- phide, along with 1-decanol, is Category 2 chemical and it was frequently predicted as a false negative in VRMs, suggesting that Ker- aSkin™ SIT may be more sensitive than existing VRMs. By correctly identifying Di-n-propyl disulphide, the sensitivity of KeraSkin™ SIT was higher than VRMs for 20 reference chemicals, exceeding the sensitivity Table 5 Cohen’s Kappa statistics of the results between the KeraSkin™ SIT and the validated reference methods in OECD TG 439. comparison Kappa coefficient P- value EpiSkin™ SIT .608 .007 EpiDerm™ SIT .828 .000 LabCyte EPIMODEL24 SIT .828 .000 Table 6 Predictive capacity of KeraSkin™ SIT for 20 reference chemicals in OECD TG 439 PS. PS criteria Predictive capacity BS (60) KCL (60) COSMAX (60) Total(180) Sensitivity ≥80% 100% 100% 96.7% 98.9% Specificity ≥70% 70% 70% 70% 70% Accuracy ≥75% 85% 85% 83.3% 84.4% Lab1; Biosolution Co., Lab2; Korea Conformity Laboratories, Lab3; COSMAX. Table 7 Color Interference check for 20 PS reference chemicals in COSMAX. J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 11 criteria of PS, which was confirmed again with 66 chemicals (sensitivity of 95.2%). Comparison of KeraSkin™ SIT with VRMs of OECD TG 439 revealed that KeraSkin™ SIT agrees well with VRMs with respect to irritant de- cisions and raw data (viability). Various statistical analyses were conducted, such as ICC, Bland-Altman plot and Kappa statistics. Pair- wise comparison with all VRMs using these statistical methods all sug- gests a moderate or high correlation between KeraSkin™ SIT and existing VRMs. Interestingly, KeraSkin™ SIT was more similar to Epi- Derm™ and LabCyte EPI-MODEL24 than Episkin™ as determined by Table 8 List of 46 substances for further evaluation. No Chemicals CASN In vivo Class (In vivo Cat.) Physical Status Storage condition Manufacturer Cat No. Purity (%) 1 Phenylethyl alcohol (2-Phenylethanol) 60-12-8 NI (NC) liquid RT Sigma 77861 99.7 2 Isopropyl myristate 110-27-0 NI (NC) liquid RT Sigma 172472 98.0 3 Triethylene glycol 112-27-6 NI (NC) liquid RT Sigma T59455 99.3 4 4,4′-Methylenebis (2,6-di-tert-butylphenol) 118-82-1 NI (NC) solid RT Sigma 277924 99.6 5 Benzyl benzoate 120-51-4 NI (NC) liquid RT Sigma B6630 99.8 6 3-Chloronitrobenzene (1-Chloro-3-nitrobenzene) 121-73-3 NI (NC) solid RT Sigma 218758 99.8 7 Isopropyl palmitate 142-91-6 NI (NC) liquid RT Sigma W515604 99.9 8 Dodecanoic acid (Lauric acid) 143-07-7 NI (NC) solid RT Sigma W261408 98.0 9 Sodium bicarbonate 144-55-8 NI (NC) solid RT Sigma S5761 100.0 10 4-Amino-1,2,4-triazole (4-Amino-4H-1,2,4-triazole) 584-13-4 NI (NC) solid RT Sigma A81803 99.8 11 3,3′-Dithiodipropionic acid 1119-62-6 NI (NC) solid RT Sigma 109010 99.8 12 Silane A-1430 ((3-Chloro propyl) trimethoxy silane) 2530-87-2 NI (NC) liquid RT Sigma 440183 97.5 13 2,5-Dimethyl-4-oxo-4,5-dihydrofuran-3-yl acetate (4- Acetoxy-2,5-dimethyl-3(2H)-furanone) 4166-20-5 NI (NC) liquid RT TCI A2192 98.1 14 Di-propylene glycol 25265-71- 8 NI (NC) liquid RT Sigma D215554 99.0 15 Polyethylene glycol 400 25322-68- 3 NI (NC) liquid RT TCI N0443 – 16 Dipropylene glycol monobutyl ether (DPnB) (Di(propylene glycol) butyl ether, mixture of isomers) 29911-28- 2 NI (NC) liquid RT Sigma 484237 99.1 17 Propylene glycol 57-55-6 NI (NC) liquid RT Sigma P4347 100.0 18 1-(4-Chlorophenyl)-3-(3,4-dichlorophenyl)urea 101-20-2 NI (NC) solid RT Sigma 105937 98.9 19 Erucamide 112-84-5 NI (NC) solid − 18 ◦C Sigma 280577 88.4 20 Benzyl salicylate 118-58-1 NI (NC) liquid RT Sigma 84260 99.9 21 3.3-Dimethylpentane 562-49-2 NI (NC) liquid RT ACROS 152660050 99.6 22 Tetrabromophenol blue 4430-25-5 NI (NC) solid RT Sigma 199311 80.0 23 2-Ethylhexyl 4-methoxycinnamate 5466-77-3 NI (NC) liquid RT Sigma 78848 98.9 24 Sodium bisulphite 7631-90-5 NI (NC) solid RT Sigma 243973 – 25 2-(Formylamino)-3-thiophenecarboxylic acid 43028-69- 9 NI (NC) solid RT Sigma S935344 100.0 26 Methyl laurate (at 37 ◦C, liquid) 111-82-0 NI (Cat 3) liquid RT Sigma 234591 99.9 27 Benzophenone-3 131-57-7 NI (NC) solid RT Sigma H36206 99.5 28 3-Chloro-4-fluoronitrobenzene (below 25 ◦C, solid) 350-30-1 NI (NC) solid RT Sigma 233234 99.2 29 Cyclohexadecanone (below 37 ◦C, solid) 2550-52-9 NI (NC) solid RT Symrise 600359 – 30 2-Phenylhexanenitrile3508-98-3 NI (Cat 3) liquid RT IFF 190979 – 31 2-Ethylhexyl-4-aminobenzoate 26218-04- 2 NI (NC) solid 4 ◦C Santacruz SC-496103 96.0 32 Capryl-isostearate 209802- 43-7 NI (NC) liquid RT Nikko chemical S028795 100.0 33 Diisopropyl sebacate 7491-02- 03 NI (NC) liquid RT Nippon Fine Chemicals – 100.0 34 Barium sulfate 7727-43-7 NI (NC) solid RT/ hygroscopic ACROS 222515000 99.2 35 10% Xanthan gum (in D.W.) 11138-66- 2 NI (NC) gel RT/ hygroscopic Sigma 43708 – 36 1-Bromopentane 110-53-2 I (Cat 2) liquid RT Sigma 117811 99.5 37 Nonanoic acid (at 37 ◦C, liquid) 112-05-0 I (Cat 2) liquid 2–8 ◦C Sigma N5502 97.0 38 Decanoic acid (at 37 ◦C, liquid) 334-48-5 I (Cat 2) liquid RT Sigma W236403 99.3 39 Butyl methacrylate 97-88-1 I (Cat 2) liquid RT Sigma 235865 99.4 40 alpha-Terpineol 98-55-5 I (Cat 2) liquid RT Sigma W304506 97.9 41 Butyric acid 107-92-6 I (Cat 1B) liquid RT Sigma B103500 99.8 42 Heptanoic acid 111-14-8 I (R34) liquid RT Sigma 75190 99.9 43 Octanoic acid (Caprylic acid) 124-07-2 I (Cat 1B/1C) liquid RT Sigma C2875 99.9 44 25% Cetrimonium chloride (in D.W.) 112-02-7 I (Cat 2) liquid RT Sigma 292737 – 45 Ferric chloride 7705-08-0 I (R34) solid RT Sigma 157740 97.0 46 Piroctone olamine 68890-66- 4 I (Cat 2) solid 4 ◦C TCI P2178 99.8 J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 12 ICC and Kappa statistics. More importantly, Bland-Altman plot revealed that KeraSkin™ SIT appears to be more sensitive overall than existing VRMs, which can explain the higher sensitivity than VRMs. These points could not be seen with a simple point estimation of predictive capacity, reflecting the utility of statistical methods for evaluating and comparing me-too tests with existing gold standards as also exemplified by the recent studies (Alépée et al., 2016; Lim et al., 2019a). Color interference is a potential issue for all colorimetric assay-based in vitro tests (Joo et al., 2019). We also confirmed that decision can be changed for a non-irritant with color interference after correction. However, the correction for color interference was seldom necessary since color-interfering chemicals persistent through vigorous washing steps are not common and irritant decisions are not affected by correction. One run with one chemical needed correction among 180 runs in the main study. And among 46 chemicals tested for additional predictive capacity assessment, 10 chemicals showed concerns of color interference but correction did not change the decision (data not shown). Against this background, we suggest that correction for color interference may be reserved only for non-irritants as a post-check rather than doing pre-check for all chemicals as shown in Fig. 2. This will significantly save costly RhE tissues for color checking. Collectively, we demonstrated that KeraSkin™ SIT showed good reproducibility and predictive capacity, meeting the criteria of OECD TG 439 PS. The results of the validation study which was conducted in accordance of OECD TG 439 PS, support that KeraSkin™ SIT is compatible with other existing VRMs of OECD TG 439 in terms of reproducibility, predictive capacity and applicability domain. There- fore, we believe that KeraSkin™ SIT may be considered as a me-too method similar to RhE SIT VRMs of OECD TG 439. Funding sources This research was supported by grants from the Ministry of Food and Drug Safety in 2018 (18182MFDS463) and National Research Founda- tion (NRF) funded by Ministry of Science and ICT (MSIT) (2018R1A5A2025286). Fig. 6. KeraSkin™ SIT viability data for 66 chemicals composed of 23 solids and 43 liquids. Red color indicates falsely predicted substances. Dotted line indicates cut-off (50%) and blue line represents 60%, an optimum cut-off suggested by ROC analysis. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.) Table 9 Predictive capacity of KeraSkin™ SIT for 66 chemicals. Total (66) Liquid (43) Solid (23) PS I NI I NI I NI I 20 1 17 0 3 1 NI 8 37 7 19 1 18 Total 66 43 23 Sensitivity 95.2% 100% 75% 80% Specificity 82.2% 73.1% 94.7% 70% Accuracy 86.4% 83.7% 91.3% 75% J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 13 Fig. 7. ROC curve for 66 substances and 23 solids and 43 liquids. The receiver operating characteristic (ROC) curve was drawn with cell viability data obtained with KeraSkin™ SIT. Grey color is the current cut-off, 50% and yellow color is the optimum cut-off value suggested from the analysis. (For interpretation of the references to color in this figure legend, the reader is referred to the Web version of this article.) Table 10 Comparison of weighted and unweighted predictive capacity of KeraSkin™ SIT. Predictive capacity Unweighted PS criteria Weighted Total (66) Liquid (43) Solid (23) Total (66) Liquid (43) Solid (23) Sensitivity 95.2% 100% 75% ≥80% 94.7% 99.3% 75% Specificity 82.2% 73.1% 94.7% ≥70% 82.2% 73.1% 94.7% Accuracy 86.4% 83.7% 91.3% ≥75% 86.2% 83.5% 91.3% J. Han et al. Regulatory Toxicology and Pharmacology 117 (2020) 104725 14 Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. Acknowledgements This research was supported by grants from the Ministry of Food and Drug Safety in 2018 (18182MFDS463) and National Research Founda- tion (NRF) funded by Ministry of Science and ICT (MSIT) (2018R1A5A2025286). References Adler, S., et al., 2011. Alternative (non-animal) methods for cosmetics testing: current status and future prospects—2010. Arch. Toxicol. 85, 367–485. Akbarsha, M.A., Mascarenhas, B., 2019. Cosmetic Regulation and Alternatives to Animal Experimentation in India. Alternatives to Animal Testing. Springer, Singapore, pp. 57–62. Alépée, N., et al., 2016. Multi-laboratory evaluation of SkinEthic HCE test method for testing serious eye damage/eye irritation using solid chemicals and overall performance of the test method with regard to solid and liquid chemicals testing. Toxicol. Vitro 34, 55–70. Atkinson, G., 1998. Statistical methods for assessing measurement error (reliability) in variables relevant to sports medicine. Sports Med. 26, 217–238. Bartko, J.J., 1976. On various intraclass correlation reliability coefficients. Psychol. Bull. 83, 762. Berry, K.J., et al., 1988. A generalization of Cohen’s kappa agreement measure to interval measurement and multiple raters. Educ. Psychol. Meas. 48, 921–933. Blackman, N.J.M., 2000. Interval estimation for Cohen’s kappa as a measure of agreement. Stat. Med. 19, 723–741. Choi, J., et al., 2014. Skin corrosion and irritation test of sunscreen nanoparticles using reconstructed 3D human skin model. Environ. Toxicol. 29. Cooper 2nd, J., Saracci, R., 1979. Describing the validity of carcinogen screening tests. Br. J. Cancer. 39, 87. Giavarina, D., 2015. Understanding bland altman analysis. Biochem. Med. 25, 141–151. Hirschmann, M., et al., 2011. The position and orientation of total knee replacement components: a comparison of conventional radiographs, transverse 2D-CT slices and3D-CT reconstruction. J. Bone Joint Surg. 93, 629–633. Hwang, J.-h., et al., 2018. Investigation of dermal toxicity of ionic liquids in monolayer- cultured skin cells and 3D reconstructed human skin models. Toxicol. Vitro 46, 194–202. Joo, K.M., et al., 2019. Development and validation of UPLC method for WST-1 cell viability assay and its application to MCTT HCE eye irritation test for colorful substances. Toxicol. Vitro 60, 412–419. Jung, K.-M., et al., 2014. KeraSkin™-VM: a novel reconstructed human epidermis model for skin irritation tests. Toxicol. Vitro 28, 742–750. Kandárová, H., et al., 2009. In vitro skin irritation testing: improving the sensitivity of the EpiDerm skin irritation test protocol. Altern. Lab. Anim. 37, 671–689. Katoh, M., Hata, K., 2011. Refinement of LabCyte EPI-MODEL24 skin Irritation test method for adaptation to the requirements of OECD test guideline 439. Altern. Anim. Test. Exp. 16, 111–122. Kim, K., et al., 2017. Anti-pigmentary effect of (-)-4-Hydroxysattabacin from the marine- derived Bacterium Bacillus sp. Mar. Drugs 15, 138. Koo, T.K., Li, M.Y., 2016. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J. Chiropr. Med. 15, 155–163. Kose, O., et al., 2018. Evaluation of skin irritation potentials of different cosmetic products in Turkish market by reconstructed human epidermis model. Regul. Toxicol. Pharmacol. 98, 268–273. Landis, J.R., Koch, G.G., 1977. The measurement of observer agreement for categorical data. Biometrics 33, 159–174. Lee, M., et al., 2017. Alternatives to in vivo draize rabbit eye and skin irritation tests with a focus on 3D reconstructed human cornea-like epithelium and epidermis models. Toxicol. Res. 33, 191–203. Lee, S.-H., et al., 2016. Evaluating the micronucleus induction potential for the genotoxicity assay using the human skin model, KeraSkin TM. J Soc. Cosmet. Sci. Korea 42, 211–216. Lim, S.-E., et al., 2019a. Statistical analysis of the reproducibility and predictive capacity of MCTT HCE™ eye irritation test, a me-too test method for OECD TG 492. Regul. Toxicol. Pharmacol. 107, 104430. Lim, S.E., et al., 2019b. Me-too validation study for in vitro eye irritation test with 3D- reconstructed human cornea epithelium, MCTT HCE(TM). Toxicol. Vitro 55, 173–184. OECD, 2002. Test No. 404 : Acute Dermal Irritation/Corrosion. OECD Publishing, paris, p. 13. OECD, 2010. Test no. 439: in vitro skin irritation. In: OECD. OECD Publishing, Paris. OECD, 2015a. In Vitro Skin Irritation: Reconstructed Human Epidermis Test Method, vol. 439. OECD Publishing, paris. OECD, 2015b. No. 220 performance standards for the assessment of proposed similar or modified in vitro reconstructed human epidermis (RhE) test methods for skin irritation testing as described in TG 439. In: OECD. OECD Publishing, Paris. OECD, 2017. Guidance document on an integrated approach on testing and assessment (IATA) for skin corrosion and irritation. In: OECD. OECD Publishing, Paris. OECD, 2019a. Test No. 439: In Vitro Skin Irritation: Reconstructed Human Epidermis Test Method. OECD Publishing, Paris. OECD, 2019b. Reports of the Peer Reviews of the epiCS and Skin+ Test Methods in View of Their Inclusion in Test Guideline 439 on In Vitro Skin Irritation. Parish, S.T., et al., 2020. An evaluation framework for new approach methodologies (NAMs) for human health safety assessment. Regul. Toxicol. Pharmacol. 112, 104592. Park, J., et al., 2018. Mixture toxicity of methylisothiazolinone and propylene glycol at a maximum concentration for personal care products. Toxicol. Res. 34, 355–361. Park, S.H., et al., 2004. Receiver operating characteristic (ROC) curve: practical review for radiologists. Korean J. Radiol. 5, 11–18. Shrout, P.E., Fleiss, J.L., 1979. Intraclass correlations: uses in assessing rater reliability. Psychol. Bull. 86, 420. Spielmann, H., et al., 2007. The ECVAM international validation study on in vitro tests for acute skin irritation: report on the validity of the EPISKIN and EpiDerm assays and on the Skin Integrity Function Test. Altern. Lab. Anim. 35, 559–601. Szklo, M., Nieto, F.J., 2014. Epidemiology: Beyond the Basics. Jones & Bartlett Publishers. Yang, H., et al., 2017. Prevalidation trial for a novel in vitro eye irritation test using the reconstructed human cornea-like epithelial model, MCTT HCE™. Toxicol. Vitro. 39, 58–67. Youngstrom, E.A., 2014. A primer on receiver operating characteristic analysis and diagnostic efficiency statistics for pediatric psychology: we are ready to ROC. J. Pediatr. Psychol. 39, 204–221. J. Han et al. http://refhub.elsevier.com/S0273-2300(20)30151-3/sref1 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref1 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref2 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref2 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref2 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref3 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref3 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref3 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref3 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref4 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref4 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref5 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref5 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref6 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref6 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref7 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref7 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref8 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref8 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref9 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref9 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref10 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref11 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref11 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref11 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref12 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref12 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref12 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref13 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref13 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref13 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref14 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref14 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref15 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref15 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref16 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref16 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref16 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref17 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref17 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref18 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref18 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref19 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref19 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref19 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref20 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref20 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref21 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref21 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref21 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref22 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref22 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref22 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref23 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref23 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref23 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref24http://refhub.elsevier.com/S0273-2300(20)30151-3/sref24 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref24 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref25 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref25 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref26 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref27 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref27 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref28 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref28 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref28 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref29 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref29 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref30 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref30 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref31 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref31 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref32 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref32 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref32 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref33 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref33 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref34 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref34 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref35 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref35 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref36 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref36 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref36 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref37 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref37 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref38 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref38 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref38 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref39 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref39 http://refhub.elsevier.com/S0273-2300(20)30151-3/sref39 Me-too validation study for in vitro skin irritation test with a reconstructed human epidermis model, KeraSkin™ for OECD te ... 1 Introduction 2 Material and methods 2.1 Chemicals and reagents 2.2 KeraSkin™ SIT protocol 1.5 2.2.1 Washing 2.2.2 MTT assay 2.2.3 Detecting and correcting interference from colored chemicals and MTT reducers 2.2.4 Prediction model for the KeraSkin™ SIT 2.2.5 Acceptance criteria 2.3 Statistical analysis for the reliability of the KeraSkin™ SIT 2.3.1 Data collection and chart 2.3.2 Within-laboratory and between-laboratory reproducibility 2.3.3 Correlation with validated reference methods 2.4 Predictive capacity of the KeraSkin™ SIT 2.4.1 20 chemicals and 46 additional chemicals for the evaluation of predictive capacity 3 Results 3.1 Within-and between-laboratory reproducibility 3.2 Correlation of KeraSkin™ SIT with validated reference methods of OECD TG 439 3.3 Predictive capacity of KeraSkin™ SIT for 20 reference chemicals 3.4 Color interference 3.5 Predictive capacity of KeraSkin™ SIT for 66 chemicals 4 Discussion Funding sources Declaration of competing interest Acknowledgements References
Compartilhar