Introduction

Gastric cancer remains one of the most common malignancies and leading causes of cancer death worldwide [1]. Although the most effective treatment for localized disease is surgery, approximately half of all patients with advanced-stage disease experience recurrence following curative resection. The prognosis of patients with advanced or recurrent gastric cancer (AGC) remains poor, with commonly used combination chemotherapy regimens, consisting of a fluoropyrimidine plus a platinum agent with or without docetaxel or anthracyclines, leading to a median survival of only 1 year [28]. Therefore, the development of novel anticancer agents or strategies for the treatment for AGC is urgently required; however, for the evaluation of such agents and treatments, it is critical to conduct effective randomized trials.

Reflecting the relatively high incidence of gastric cancer worldwide, numerous clinical trials have been conducted in multiple countries or as part of global studies [7, 8]. These clinical trials have displayed surprising heterogeneity in overall survival (OS) even if patients with similar stages of unresectable AGC are targeted. Although several identified prognostic factors in patient characteristics and practice patterns, including surgery and chemotherapy, are thought to partially contribute to the observed heterogeneity [9], the exact reason for this heterogeneity is unknown.

A number of reports have evaluated prognostic factors in AGC patients who underwent chemotherapy [1014]. For example, the recent Global Advanced/Adjuvant Stomach Tumor Research through International Collaboration (GASTRIC) project confirmed the impact of performance status (PS), disease status (metastatic vs. locally recurrence vs. locally advanced), number of metastatic organs, location of metastasis, and prior surgery on the survival of AGC based on individual patient data analysis of previous randomized studies [10]. In addition, Chau et al. [11] identified four independent prognostic factors for poor AGC survival: PS ≥ 2, liver metastasis, peritoneal metastasis, and increased serum alkaline phosphatase (ALP) levels, which were subsequently used to classify patients into three risk groups (Royal Marsden hospital prognostic index) that were validated in a large phase III trial [12]. The prognostic factors for AGC identified to date also serve as important stratification factors in randomized trials to exclude possible confounding variables. To our knowledge, however, there is no consensus as to the specific patient characteristics that are most suitable to report or to be used as stratification factors in clinical trials for AGC.

Here, we report the results of a comprehensive review of published randomized trials for AGC that we conducted to investigate the patient characteristics and stratification factors that have been evaluated and reported. We also examined differences in previous studies according to trial period and region.

Materials and methods

Search for studies

We conducted a literature search for randomized clinical trials of AGC through computer-based searches of the Medline database (January 1966 and December 2010) and searches of abstracts from conference proceedings of the American Society of Clinical Oncology (1995–2010), and the European Cancer Conference and European Society for Medical Oncology (1995–2010). Search key words included: “gastric cancer,” “randomized”, “advanced or metastatic”, and “chemotherapy.” The search was also guided by a thorough examination of reference lists from original and review articles.

Procedures

Two investigators (Kohei Shitara and Keitaro Matsuo) extracted data in accordance with the Quality of Reporting of Meta-analyses (QUORUM) guidelines [15]. Randomized trials of systemic chemotherapy for patients with histologically confirmed AGC (metastatic or unresectable locally advanced disease) of the stomach or gastroesophageal junction were included in the analyses. Trials that compared chemotherapy with best supportive care were also included, as were those which included patients with adenocarcinoma of the distal esophagus. Exclusion criteria included trials designed to assess combined modality treatments, including radiotherapy and surgery (neoadjuvant or adjuvant chemotherapies); and those in which patients were pretreated with systemic chemotherapy. Unpublished trials and trials published in non-English languages were also excluded from this analysis.

For each trial, the reporting of patient characteristics and stratification factors was extracted. As trial characteristics, the following information was extracted: first author’s name, year of publication, trial design (randomized phase II or III, if reported), trial location, number of enrolled patients, and treatment regimens. As patient characteristics, the following information was extracted (if reported): age; gender; PS; histology (e.g., diffuse or intestinal type); disease status (e.g., advanced or recurrent disease); primary tumor location (e.g., stomach or gastroesophageal junction); extension of disease (e.g., locally advanced or metastatic); previous gastrectomy, adjuvant chemotherapy, and radiotherapy; sites of metastases (e.g., peritoneum, liver, and lymph node); number of metastatic organs; and proportion of patients with measurable disease. The proportion of patients who received second-line chemotherapy was also extracted. All data were checked for internal consistency.

Statistical methods

Differences in the reporting of patient characteristics according to trial period (before vs. after 2004) and trial region (Asian vs. non-Asian trials) were assessed by the χ2 test or Fisher’s exact test, as appropriate. Because there was no definitive cut-off time for performing trend analysis, we divided the period at 2004 as this led to the number of trials (36 vs. 31 trials) and number of patients being almost equally distributed in the two periods. Median values for patient characteristics were calculated for each trial and the combined patient population. Differences in patient characteristics according to region or trial period were evaluated using the Mann–Whitney test. Use of stratification factors according to trial period or region was evaluated with the χ2 test or Fisher’s exact test as appropriate. Statistical analyses were performed using STATA ver. 10 (StataCorp. LP; College Station, TX, USA). All tests were two-sided, and P values of less than 0.05 were considered statistically significant.

Results

Study selection

Our extensive literature search yielded a total of 852 potentially relevant reports, of which 743 were initially excluded on examination of titles (Fig. 1). After review of the abstracts of the remaining studies, 67 randomized trials, with a total of 153 treatment arms and 12,656 patients were identified as eligible for analysis (Supplement 1). Table 1 summarizes the characteristics of the 67 selected clinical trials, which consisted of 23 and 30 randomized phase II and III trials, respectively, and 14 trials that did not report the trial phase.

Fig. 1
figure 1

Selection process for trials. An initial literature search for randomized clinical trials of advanced gastric cancer (AGC) identified a total of 852 potentially relevant reports, of which 743 were excluded on examination of titles. After review of the abstracts of the remaining studies, 67 randomized trials, with a total of 153 treatment arms and 12,656 patients were identified as eligible for analysis. ASCO American Society of Clinical Oncology, ESMO European Society for Medical Oncology

Table 1 Characteristics of the 67 clinical trials analyzed in the present study

Patient characteristics reported in trials

Table 2 summarizes the patient characteristics reported in the 67 clinical trials included in the analysis. Two global studies that included Asian countries were excluded when comparing trials in Asia and non-Asian countries.

Table 2 Reported patient characteristics in the 67 clinical trials analyzed in the present study

Age, gender, and PS

All 67 clinical trials provided information of patient age, with nearly all (94%) providing a median value, and four trials providing categorized values. One trial targeted elderly patients (>70 years). Gender information was reported by all but one trial. Sixty-four trials (96%) provided information regarding PS, with 46 reporting Eastern Cooperative Oncology Group (ECOG)/WHO PS classifications and the other 17 using the Karnofsky Performance Scale (KPS). Considerable PS variability was detected among the trial patients, as follows: PS 0–1, 4 trials; PS 0–2, 25 trials; and PS 0–3, 17 trials; and KPS 100–80, 1 trial; KPS 100–70, 5 trials; KPS 100–60, 7 trials; and KPS 100–50, 4 trials. Among the trials that used ECOG PS, 22 reported ECOG PS 0 versus 1 versus 2, whereas the other studies reported PS 0 and 1 without discrimination. No significant differences in reporting were detected in the trial period or region for PS, age, and gender.

Disease characteristics

The proportion of patients with measurable disease was reported in 69% of trials, with half including only patients with at least one measurable disease. Extension of disease and disease status were reported in 57 and 27% of trials, respectively. The location of metastases was reported in 64% of trials; the liver was the most commonly reported site, followed by the peritoneum. Histology and the number of metastatic organs were not reported in more than half of the trials. The Lauren classification (intestinal or diffuse type) was used in 12 trials, while classifications such as the American Joint Committee on Cancer grading system (well- or poorly differentiated adenocarcinoma, etc.) were used in 18 trials. The location of primary tumors was reported in 26 trials (39%), with 17 trials including not only gastric cancer, but also esophagogastric junction or esophageal cancer. The frequency of reporting these characteristics appeared to be increasing in more recent trials, although most examined characteristics were reported in less than 60% of the trials (Table 2). Only primary tumor location was more frequently reported in non-Asian than Asian trials, and no other significant differences in reporting of disease characteristics were observed based on trial area.

The other reported patient characteristics were as follows: weight loss (n = 12; 18%); any symptoms (anorexia, dysphasia, etc., n = 7; 10%); body surface area (n = 3; 4%); ethnic groups (n = 2; 3%); hemoglobin level (n = 4; 6%); serum ALP level (n = 3; 4%); comorbidities (n = 3; 4%), and Royal Marsden hospital prognostic index (n = 1; 1%).

Previous treatment and second-line chemotherapy

An indication of the proportion of patients with previous gastrectomy was reported in 69% of trials, with the curability of gastrectomy (curative or palliative with residual disease) specified in approximately 50% of trials. Previous adjuvant chemotherapy and radiotherapy were infrequently reported (24 and 16% of trials, respectively). Second-line chemotherapy was also reported with low frequency (27% of trials), and was typically indicated in the text, rather than being included in patient characteristic tables. The reporting of previous treatment and second-line chemotherapies was found to be increasing in recent trials, although more than half did not include information related to second-line chemotherapy. In addition, Asian trials more commonly reported the use of adjuvant chemotherapy than non-Asian trials.

Patient characteristics of the combined trial population

The characteristics of the 12,656 AGC patients were calculated based on the reported values in each of the 67 clinical trials (Table 3). Recent trials included more patients with better PS (ECOG PS 0–1; 94 vs. 64%; P < 0.01) and less locally advanced disease (4 vs. 27%) than older trials. Asian trials included more patients with diffuse histology than non-Asian trials (53 vs. 34%; P < 0.01), while patients with liver metastasis (43 vs. 31%; P = 0.01) or locally advanced disease (15 vs. 3%; P = 0.04) were more common in non-Asian trials. Second-line chemotherapy was more commonly used in Asian and recent trials.

Table 3 Patient characteristics (n = 12,656) in AGC trials included in this analysis

Stratification factors

Among the 67 trials, 40 (60%) used stratification factors (Table 4). The median number of factors was 3, with an observed range of 1–5. The most common stratification factor was PS, followed by institution and previous gastrectomy. More recent trials used one or more stratification factors than older trials (47 vs. 75%, P = 0.03, Table 4). Gender was more commonly used in older trials (14 vs. 0%). No significant difference of stratification factors was observed between Asian and non-Asian trials, other than the frequency of use of disease status (0 vs. 14%).

Table 4 Stratification factors in the 67 clinical trials analyzed in the present study

Discussion

To our knowledge, this represents the first study to review the reporting of patient characteristics in published randomized trials for AGC. Our results showed considerable inconsistency in the reporting of patient characteristics and the use of stratification factors in clinical trials for AGC. A similar finding was reported by Sorbye et al. [16], who analyzed metastatic colorectal cancer (MCRC) clinical trials and advocated that an urgent need exists for an international consensus on the reporting of patient characteristics and stratification in MCRC trials. Our data also revealed several differences in patient characteristics between trials conducted before and after 2004, and between Asian and non-Asian trials. It is possible that these differences may have contributed to the observed heterogeneity in the survival outcomes of each trial.

Several prognostic factors have been identified for patients with AGC who have undergone chemotherapy [1014]. As described in the “Introduction”, the GASTRIC project confirmed the impact of ECOG PS, disease status, number of metastatic organs, location of metastasis, and prior surgery on the survival of AGC patients, as determined by individual patient data analysis of previous randomized studies [10]. Notably, this project, which may have included the largest AGC patient set to date, identified that PS1 and PS2 were significantly associated with poor survival, with hazard ratios (HRs) of death of 1.36 and 2.17, respectively [10]. In the GASTRIC analysis, although most trials included PS among the reported patient characteristics, a number of studies classified PS0 and PS1 separately, and several studies used KPS rather than the ECOG scale. In addition, local recurrence and metastatic disease were reported to be associated with worse outcomes than locally advanced disease [10]. In our present analysis, approximately 50% of trials reported disease extension (locally advanced or metastatic disease), and only 30% of trials indicated disease status (advanced or recurrent disease).

Although the GASTRIC analysis did not evaluate the importance of specific metastatic organs on outcomes, another large prognostic analysis, by Chau et al. [11, 12], reported the impact of liver and peritoneal metastasis on AGC patient survival. Affected metastatic organs were reported in 64% of the trials in our analysis, but the number of metastatic organs, which has significant impact on survival according to the GASTRIC analysis, was only reported with a frequency of 39%. Although histology was not identified as prognostic in the GASTRIC analysis, several recent trials suggest that an interaction exists between histology and drug response [6, 7, 17, 18]. For example, a subset analysis of the First-line Advanced Gastric Cancer Study (FLAGS) trial has indicated that the oral fluoropyrimidine S-1 appears to be superior to fluorouracil in the treatment of diffuse-type gastric cancer [6]. This finding is consistent with the results of a subset analysis of the Japan Clinical Oncology Group (JCOG) 9912 study that also indicated S-1 is better than fluorouracil in patients with diffuse-type AGC or gastric cancer associated with high dihydropyrimidine dehydrogenase (DPD) activity, which is more commonly associated with diffuse-type than intestinal-type tumors [17]. This result was not unexpected, because S-1 is a potent competitive inhibitor of DPD. In contrast to DPD, human epidermal growth factor receptor 2 (HER2)-positive AGC, for which the anti-HER2 agent trastuzumab is effective [7], is reported to be higher among intestinal-type tumors [18]. The prognostic factors and tumor characteristics identified in these studies should be reported in all clinical trials of AGC, as they are necessary to adequately interpret trial data and treatment outcomes.

Our analysis also revealed that the types of second-line chemotherapy and proportions of patients who received such treatment were not routinely reported in AGC trials. As several recent reports have suggested that second-line chemotherapy has a significant impact on OS [1921], we propose that second-line therapies should be diligently reported in future clinical trials of first-line AGC treatment, because second-line chemotherapy might influence the OS as the primary endpoint, as suggested by our previous analysis [22].

Additionally, the numerous prognostic factors identified for AGC may be important for the stratification of patients with respect to risk and treatment arms in randomized trials. To adequately analyze treatment effects on clinical outcomes, efforts should be undertaken to maximally decrease imbalance of prognostic factors between treatment arms in a clinical trial [23]. Although there is no definite consensus on the optimal method for stratification, stratification is recommended for superiority trials with fewer than 400 patients [24] and for non-inferiority trials with any number of patients [25]. In our analysis, stratification was conducted in only 60% of the examined trials, and was performed with quite variable stratifying factors. Based only on the present analysis, it is difficult to suggest a standardization approach for stratification factors in AGC trials, and further analysis and discussion are necessary.

In recent years, a trend of increased median OS in AGC patients has been observed concurrent with the development of new chemotherapeutic agents [2, 4, 7, 26]. It is also possible that second-line chemotherapy may have contributed to the improvement in OS [1921]; however, our crude comparison of trials conducted prior to and after 2004 also showed significant differences in PS and disease extension. These differences may have also contributed to the improved survival reported in more recent trials, as well as survival differences between Asian and non-Asian trials. The exact impact of chemotherapy and patient characteristics on survival would be best addressed in well-designed randomized studies and meta-analyses of individual patient data.

In conclusion, our analyses of published clinical trials for AGC revealed inconsistencies in the reporting of patient characteristics and use of stratification factors. An international consensus on the reported characteristics and stratification in AGC trials is necessary to improve the analysis of future clinical trials.