09/03/2024


Recently, several anti-inflammatory peptides (AIPs) have been found in the process of the inflammatory response, and these peptides have been used to treat some inflammatory and autoimmune diseases. Therefore, identifying AIPs accurately from a given amino acid sequences is critical for the discovery of novel and efficient anti-inflammatory peptide-based therapeutics and the acceleration of their application in therapy. In this paper, a random forest-based model called iAIPs for identifying AIPs is proposed. First, the original samples were encoded with three feature extraction methods, including g-gap dipeptide composition (GDC), dipeptide deviation from the expected mean (DDE), and amino acid composition (AAC). Second, the optimal feature subset is generated by a two-step feature selection method, in which the feature is ranked by the analysis of variance (ANOVA) method, and the optimal feature subset is generated by the incremental feature selection strategy. Finally, the optimal feature subset is inputted into the random forest classifier, and the identification model is constructed. Experiment results showed that iAIPs achieved an AUC value of 0.822 on an independent test dataset, which indicated that our proposed model has better performance than the existing methods. Furthermore, the extraction of features for peptide sequences provides the basis for evolutionary analysis. The study of peptide identification is helpful to understand the diversity of species and analyze the evolutionary history of species.Background Non-small cell lung cancer (NSCLC) is among the major health problems around the world. Reliable biomarkers for NSCLC are still needed in clinical practice. We aimed to develop a novel ferroptosis- and immune-based index for NSCLC. Methods The training and testing datasets were obtained from TCGA and GEO databases, respectively. Immune- and ferroptosis-related genes were identified and used to establish a prognostic model. Then, the prognostic and therapeutic potential of the established index was evaluated. Results Intimate interaction of immune genes with ferroptosis genes was observed. A total of 32 prognosis-related signatures were selected to develop a predictive model for NSCLC using LASSO Cox regression. Patients were classified into the high- and low-risk group based on the risk score. Patients in the low-risk group have better OS in contrast with that in the high-risk group in independent verification datasets. Besides, patients with a high risk score have shorter OS in all subgroups (T, N, and M0 subgroups) and pathological stages (stage I, II, and III). The risk score was positively associated with Immune Score, Stromal Score, and Ferroptosis Score in TCGA and GEO cohorts. A differential immune cell infiltration between the high-risk and the low-risk groups was also observed. Finally, we explored the significance of our model in tumor-related pathways, and different enrichment levels in the therapeutic pathway were observed between the high- and low-risk groups. Conclusion The present study developed an immune and ferroptosis-combined index for the prognosis of NSCLC.Predicting the protein sequence information of enzymes and non-enzymes is an important but a very challenging task. Existing methods use protein geometric structures only or protein sequences alone to predict enzymatic functions. Thus, their prediction results are unsatisfactory. In this paper, we propose a novel approach for predicting the amino acid sequences of enzymes and non-enzymes via Convolutional Neural Network (CNN). In CNN, the roles of enzymes are predicted from multiple sides of biological information, including information on sequences and structures. We propose the use of two-dimensional data via 2DCNN to predict the proteins of enzymes and non-enzymes by using the same fivefold cross-validation function. We also use an independent dataset to test the performance of our model, and the results demonstrate that we are able to solve the overfitting problem. We used the CNN model proposed herein to demonstrate the superiority of our model for classifying an entire set of filters, such as 32, 64, and 128 parameters, with the fivefold validation test set as the independent classification. Via the Dipeptide Deviation from Expected Mean (DDE) matrix, mutation information is extracted from amino acid sequences and structural information with the distance and angle of amino acids is conveyed. The derived feature maps are then encoded in DDE exploitation. The independent datasets are then compared with other two methods, namely, GRU and XGBOOST. All analyses were conducted using 32, 64 and 128 filters on our proposed CNN method. The cross-validation datasets achieved an accuracy score of 0.8762%, whereas the accuracy of independent datasets was 0.7621%. Additional variables were derived on the basis of ROC AUC with fivefold cross-validation was achieved score is 0.95%. The performance of our model and that of other models in terms of sensitivity (0.9028%) and specificity (0.8497%) was compared. The overall accuracy of our model was 0.9133% compared with 0.8310% for the other model.Non-coding RNAs have remarkable roles in acute lung injury (ALI) initiation. Nevertheless, the significance of long non-coding RNAs (lncRNAs) in ALI is still unknown. Herein, we purposed to identify potential key genes in ALI and create a competitive endogenous RNA (ceRNA) modulatory network to uncover possible molecular mechanisms that affect lung injury. https://www.selleckchem.com/products/S31-201.html We generated a lipopolysaccharide-triggered ALI mouse model, whose lung tissue was subjected to RNA sequencing, and then we conducted bioinformatics analysis to select genes showing differential expression (DE) and to build a lncRNA-miRNA (microRNA)- mRNA (messenger RNA) modulatory network. Besides, GO along with KEGG assessments were conducted to identify major biological processes and pathways, respectively, involved in ALI. Then, RT-qPCR assay was employed to verify levels of major RNAs. A protein-protein interaction (PPI) network was created using the Search Tool for the Retrieval of Interacting Genes (STRING) database, and the hub genes were obtained wxpand our knowledge on the regulation mechanisms of lncRNA-related ceRNAs in the pathogenesis of ALI.Background Hepatocellular carcinoma (HCC) is one of the most common malignant tumors worldwide, and its prognosis remains unsatisfactory. The identification of new and effective markers is helpful for better predicting the prognosis of patients with HCC and for conducting individualized management. The oncogene Aurora kinase A (AURKA) is involved in a variety of tumors; however, its role in liver cancer is poorly understood. The aim of this study was to establish AURKA-related gene signatures for predicting the prognosis of patients with HCC. Methods We first analyzed the expression of AURKA in liver cancer and its prognostic significance in different data sets. Subsequently, we selected genes with prognostic value related to AURKA and constructed a gene signature based on them. The predictive ability of the gene signature was tested using the HCC cohort development and verification data sets. A nomogram was constructed by integrating the risk score and clinicopathological characteristics. Finally, the influence of the gene signature on the immune microenvironment in HCC was comprehensively analyzed. Results We found that AURKA was highly expressed in HCC, and it exhibited prognostic value. We selected eight AURKA-related genes with prognostic value through the protein-protein interaction network and successfully constructed a gene signature. The nine-gene signature could effectively stratify the risk of patients with HCC and demonstrated a good ability in predicting survival. The nomogram showed good discrimination and consistency of risk scores. In addition, the high-risk group showed a higher percentage of immune cell infiltration (i.e., macrophages, myeloid dendritic cells, neutrophils, and CD4+T cells). Moreover, the immune checkpoints SIGLEC15, TIGIT, CD274, HAVCR2, and PDCD1LG2 were also higher in the high-risk group versus the low-risk group. Conclusions This gene signature may be useful prognostic markers and therapeutic targets in patients with HCC.Genetic differences between individuals underlie susceptibility to many diseases. Genome-wide association studies (GWAS) have discovered many susceptibility genes but were often limited to cohorts of predominantly European ancestry. Genetic diversity between individuals due to different ancestries and evolutionary histories shows that this approach has limitations. In order to gain a better understanding of the associated genetic variation, we need a more global genomics approach including a greater diversity. Here, we introduce the Healthy Life in an Urban Setting (HELIUS) cohort. The HELIUS cohort consists of participants living in Amsterdam, with a level of diversity that reflects the Dutch colonial and recent migration past. The current study includes 10,283 participants with genetic data available from seven groups of inhabitants, namely, Dutch, African Surinamese, South-Asian Surinamese, Turkish, Moroccan, Ghanaian, and Javanese Surinamese. First, we describe the genetic variation and admixture within the HELIUS cohort. Second, we show the challenges during imputation when having a genetically diverse cohort. Third, we conduct a body mass index (BMI) and height GWAS where we investigate the effects of a joint analysis of the entire cohort and a meta-analysis approach for the different subgroups. Finally, we construct polygenic scores for BMI and height and compare their predictive power across the different ethnic groups. Overall, we give a comprehensive overview of a genetically diverse cohort from Amsterdam. Our study emphasizes the importance of a less biased and more realistic representation of urban populations for mapping genetic associations with complex traits and disease risk for all.The U-box gene encodes a ubiquitin ligase that contains a U-box domain. The plant U-box (PUB) protein plays an important role in the plant stress response; however, very few studies have investigated the role of these proteins in Moso bamboo (Phyllostachys edulis). Thus, more research on PUB proteins is necessary to understand the mechanisms of stress tolerance in P. edulis. In this study, we identified 121 members of the PUB family in P. edulis (PePUB), using bioinformatics based on the P. edulis V2 genome build. The U-box genes of P. edulis showed an uneven distribution among the chromosomes. Phylogenetic analysis of the U-box genes between P. edulis and Arabidopsis thaliana suggested that these genes can be classified into eight subgroups (Groups I-VIII) based on their structural and phylogenetic features. All U-box genes and the structure of their encoded proteins were identified in P. edulis. We further investigated the expression pattern of PePUB genes in different tissues, including the leaves, panicles, rhizomes, roots, and shoots. The qRT-PCR results showed that expression of three genes, PePUB15, PePUB92, and PePUB120, was upregulated at low temperatures compared to that at 25°C. The expression levels of two PePUBs, PePUB60 and PePUB120, were upregulated under drought stress. These results suggest that the PePUB genes play an important role in resistance to low temperatures and drought in P. edulis. This research provides new insight into the function, diversity, and characterization of PUB genes in P. edulis and provides a basis for understanding their biological roles and molecular mechanisms.