- Research
- Open access
- Published:
Genome mining, structural elucidation and surface-active property of a new lipopeptide from Bacillus subtilis
Microbial Cell Factories volume 24, Article number: 106 (2025)
Abstract
Background
The Bacillus genus is well known for producing structurally diverse lipopeptides, many of which exhibit remarkable surface-active and bioactive properties, such as surfactin and daptomycin. In recent years, genome mining has emerged as an effective tool for the discovery of novel natural products by predicting biosynthetic gene clusters and linking them to secondary metabolite production. However, the full biosynthetic potential of many Bacillus subtilis strains remains unexplored. Therefore, this study aimed to investigate the biosynthetic potential of an oilfield-isolated Bacillus subtilis strain through genome mining, with the goal of identifying novel lipopeptides with enhanced surface activity.
Results
In this study, we identified 14 biosynthetic gene clusters, four of which were related to lipopeptide biosynthesis. In addition, a lipopeptide was characterized as a new member of the surfactin family, namely surfactin-C18. The primary structure of surfactin-C18 was determined to be a heptapeptide ring of N-Glu-Leu-Leu-Val-Asp-Leu-Leu-C linked to the longest β-hydroxy fatty acid in the surfactin family, containing 18 carbon atoms. Moreover, we investigated the surface activity of surfactin-C18, measuring its critical micelle concentration and the surface tension to be 1.99 µmol/L and 28.63 mN/m, respectively. The obtained adsorption parameters of surfactin-C18 at the air/liquid interface further explained its enhanced surface activity in comparison with other surfactin homologs and commercial surfactants.
Conclusions
To the best of our knowledge, this is the first report on the structural characterization and surface activity of surfactin-C18. In addition, our findings not only demonstrate the biosynthetic potential of B. subtilis but also highlight the power of the genome mining strategy for discovering novel lipopeptides with industrial applications.
Background
Lipopeptides are promising microbial-derived biosurfactants produced by various genera, including Bacillus, Pseudomonas, Streptomyces, Paenibacillus, and Brevibacillus [1]. Microbial lipopeptides were first discovered in the 1950s, and to date, more than 237 lipopeptides from approximately 57 families have been identified and characterized in terms of their chemical structures and properties [2]. According to their structural characteristics, microbial lipopeptides are mainly divided into three representative families: surfactin, iturin and fengycin, which typically consist of a hydrophobic fatty acid chain linked to a hydrophilic peptide chain by lactone or amide bonds [3]. This unique amphiphilic structure enables lipopeptides to modify the interfacial properties, reducing surface tension, as well as exhibiting remarkable bioactivities [4, 5]. These lipopeptides are biosynthesized by nonribosomal peptide synthetases (NRPS), which facilitate the fatty acid chain and amino acid residues incorporating into the backbone [6]. Therefore, the structural complexity and functional diversity of microbial lipopeptides highlight their potential as bioactive natural products with ever-expanding applications [7,8,9].
With advancements in microbial genome sequencing and bioinformatics technologies, genome mining has been considered a powerful tool to uncover biosynthetic gene clusters (BGCs) capable of encoding novel natural products [10, 11]. In particular, within the genera Bacillus, researchers have successfully identified new lipopeptides with previously uncharacterized structures and enhanced bioactive features by integrating genome mining with mass spectrometry detection [12]. Although hundreds of Bacillus genomes have been completed sequenced and uploaded into public databases, the genome mining studies remain limited in evaluating their biosynthetic potential and predicting unknown compounds [13]. Consequently, the exploration of uncharacterized BGCs and the discovery of novel bioactive natural products through genome mining in representative Bacillus strains remain critical research priorities.
Surfactin, primarily produced by Bacillus genus, has been one of the most well-known cyclic lipopeptides due to the structural diversity and excellent activities since it was first discovered in 1968 [14]. Its structure has been extensively characterized, composing of a typical heptapeptide moiety (L-Glu1-L-Leu2-D-Leu3-L-Val4-L-Asp5-D-Leu6-L-leu7) linked to a β-hydroxyl fatty acid chain of 11–17 carbon atoms to form a lactone ring structure [15]. The structural diversity of the surfactin family has been widely documented, and numerous research groups continue to isolate and characterize novel compounds, further expanding the lipopeptide family [16, 17]. In addition to its excellent surface activity under extreme environmental conditions, surfactin displays high biodegradability, low toxicity, along with reduced environmental impact [18, 19]. The outstanding physicochemical properties of surfactin also contribute to its remarkable biological activities, including antibiofilm, antimicrobial, antiviral and antitumor effects [20, 21]. These advantages have sparked increasing biotechnological and pharmaceutical interest, leading to its potential applications as biosurfactant, food additive, biopesticide and drug [22]. Therefore, it is essential to discover new surfactin variants with better performance to further enhance their applicability.
In this study, we explored the metabolic potential of a Bacillus subtilis strain TD7 through the genome mining approach. The presence of 14 putative biosynthetic gene clusters guided the discovery of a new lipopeptide from the surfactin family, named surfactin-C18. The compound was isolated and structurally characterized by a series of analytical techniques, including ESI-MS, LC-MS, GC-MS and NMR. Meanwhile, the surface-active property of surfactin-C18 was first preliminarily evaluated and compared with known surfactin homologs. Overall, these results laid an experimental foundation for the genome mining-based discovery of novel microbial lipopeptides, and may significantly broaden their industrial applications in enhanced oil recovery and cosmetics.
Results and discussion
Genomic features and annotation
We collected environmental samples from the Daqing oilfield, from which the Bacillus subtilis TD7 strain was isolated. To investigate the genomic and metabolic features, the complete genome of B. subtilis TD7 strain was sequenced, yielding a fully assembled circular chromosome (Fig. S1) with a G + C content of 43.81% (Table S1). A total of 4236 protein-coding genes (CDS) with 11 sRNAs and 30 rRNA (5–16 S-23 S rRNA) were predicted with a total length of 3,613,914 bp after genomic component analysis, which accounts for 88.77% of the full length of the genome. Comprehensive gene prediction and functional gene annotation were performed using seven commonly used databases, and a total of 4200, 3849, 3517, 3236, 2125, 4173, and 191 unique genes were matched to sequences in the NR, Swiss-Prot, Pfam, COG, GO, KEGG and CAZyme databases, respectively.
Among the 26 functional classifications based on COG annotation analysis, the top five COG categories of B. subtilis TD7 were general function prediction only (R, 389 genes, 10.14%), amino acid transport and metabolism (E, 338 genes, 8.81%), carbohydrate transport and metabolism (G, 338 genes, 8.81%), transcription (K, 331 genes, 8.63%) and function unknown (S, 246 genes, 6.41%) (Fig. 1). In addition, 112 genes were successfully classified under secondary metabolites biosynthesis, transport and catabolism (S), indicating the metabolic potential of B. subtilis TD7. Moreover, Gene Ontology (GO) classification assigned the identified genes into 24 groups across three major functional categories: molecular function (ten groups), cellular component (five groups) and biological processes (nine groups) (Fig. S2). Among these, the genes related to biological processes were the most abundance, followed by those associated with molecular functions, while genes classified under cellular component was the least abundant. For molecular functions, the three functions with the most abundance were catalytic activity, binding and transporter activity. Among cellular components, the top classifications were membrane, membrane part and cell part. In the biological processes category, the most represented functions were metabolic process, cellular process and single-organism process. Overall, the above results indicat that many functions of B. subtilis TD7 are related to metabolism and membrane transport, providing robust evidence for its metabolic capabilities.
The Bacillus genus is well known as a promising source of secondary metabolites, and numerous bioactive compounds, such as lipopeptides, have been extensively characterized. To better understand the metabolic characteristics of B. subtilis TD7, we used the antiSMASH webserver to predict biosynthetic gene clusters (BGCs) in the genome and identify their corresponding secondary metabolites. The analysis demonstrated fourteen plausible BGCs, including four nonribosomal peptide synthetases (NRPSs), one T3PKS, one lanthipeptide-class-I, one CDPS, one sactipeptide, one RiPP-like, one epipeptide, and four unclassified BGCs (Fig. 2 and Table S2). The NRPS cluster 1, which shares 82% similarity with surfactin biosynthesis in the antiSMASH database, contains all domains required for surfactin production. Thus, it is believed that the B. subtilis TD7 strain possesses biosynthetic potential for surfactin production.
Schematic diagram of fourteen secondary biosynthesis gene clusters in B. subtilis TD7. The potential BGCs were predicted using antiSMASH webserver. Each number on the overview Fig. corresponds to one biosynthetic gene cluster. Genes with different functions are marked in different colour blocks, which are shown in the legend
It is commonly recognized that the rich repertoire of BGCs from the Bacillus genus, which encodes a diverse array of bioactive secondary metabolites, including polyketides, siderophores, terpenes, and nonribosomally lipopeptides and ribosomally synthesized peptides [23,24,25]. Among these, NRPS-dependent lipopeptides, such as surfactin, iturin, daptomycin, exhibit multifunctional properties as biosurfactants, antimicrobials, antibiotics and antitumor agents [26]. Given their structural diversity and functional versatility, the discovery of novel lipopeptides remains a key focus in natural product research. Therefore, using targeted genome mining tools to identify and predict BGCs responsible for the production of secondary metabolites has emerged as a powerful approach [27,28,29]. Here, we performed antiSMASH analysis of the B. subtilis TD7 genome and identified 14 putative BGCs, four of which were associated with lipopeptide biosynthesis: surfactin, fengycin, bacillaene and bacillibactin, respectively, revealing the high biosynthetic potential of the B. subtilis strain for producing structurally diverse lipopeptides with abundant activities.
Isolation and purification of crude lipopeptide
Based on its unique metabolite profiles, a series of pure lipopeptide compounds have been purified from the cell-free broth via acid precipitation, ethyl acetate extraction and RP-HPLC. Several peaks were observed in the separation and preparative chromatograms of the lipopeptide extract (Fig. S3). According to our previous reports, the fractions with the elution time ranging from 15 min to 75 min corresponded to the surfactin homologs, specifically surfactin-C13 to surfactin-C17 [30]. Therefore, we speculated that the fraction at 95.6 min represented an unknown surfactin variant, which was confirmed using the ninhydrin test and TLC analysis. An analytical RP-HPLC system was subsequently employed to collect and purify this fraction for downstream analyses. The results indicated that this fraction was a pure product, designated as M1. Overall, lipopeptide M1 was collected, concentrated and subjected to a series of structural characterization techniques, with a yield of approximately 4 mg/L of culture (0.74% of total surfactin content). This yield was significantly lower than that of other surfactin homologs, as determined by LC-MS analysis as described previously (Table 1) [31]. Considering the minor differences in the molecular configuration and composition among surfactin homologs, there is no consistent retention time pattern for surfactins with the same or different β-hydroxy fatty acid chain lengths, especially when analyzing mixtures of homologs with varying fatty acid chain lengths. To address this challenge, Zhao et al. developed a new GC-MS method for the accurate quantification of individual surfactin homologs based on their β-hydroxy fatty acid lengths [32]. After hydrolysis and methyl esterification, the extracted ion chromatogram at m/z = 103 was used to identify β-hydroxy fatty acids. Furthermore, differences in molecular configurations and lengths resulted in variation in retention time, even for β-OH fatty acid methyl esters of the same chain lengths. Using this method, Meng et al. successfully compared the composition and relative abundance of different surfactin homologs produced by wild-type and mutan B. subtilis strains [33].
Structural characterization
Lipopeptide M1 was obtained as a yellowish-brown solid. The molecular mass of lipopeptide M1 was determined by high-resolution electrospray ionization mass spectrometry (ESI-MS) analysis in the positive ionization mode (Fig. 3). The most abundant fragment ion was at m/z 1078.73 [M + H]+, which was preliminarily attributed to the surfactin family of lipopeptides. According to previously reported studies, the molecular weight of surfactin-C17 (m/z 1063.06) was 14 Da less than that of lipopeptide M1, corresponding to an additional methylene group. These observed main peaks (m/z values) also corresponded to various surfactin homologs containing different types of amino acids or amino acid modifications at different locations on the peptide ring [34]. Therefore, further analysis of the chemical structure of the fatty acid chain and peptide ring is required.
The detailed information on the chemical structure of lipopeptide M1 was based on ESI-MS (Fig. 3A). A traditional method used to determine the sequence of amino acids in lipopeptide is Edman degradation, which relies on analyzing separated fragments obtained after partial hydrolysis of the cyclic lipopeptide [35]. However, Yang et al. established a TOF MS/MS method to directly determine the relationship of amino acid residues without hydrolysis [36]. According to the double hydrogen transfer (DHT) mechanism, a type of the McLafferty rearrangements of alkoxy group of aliphatic esters and amides, this process was considered a diagnostic tool in mass spectrometry to confirm the presence of carbonyl-containing functional group [37]. Thus, taking the ion peak m/z at 1078.73 as a precursor, the observed loss of 18 Da from 1078.73 to 1060.62 was assigned to a ring-opening reaction of an ester bond in the cyclic lipopeptide. Additionally, based on the simple cleavage of the hydrogen-ionized molecular ions, several fragment ions corresponded to the loss of amino acid residues (Fig. 3B). The fragment ions at m/z 1060.62, 965.57, 737.55, 525.39, 412.31 and 283.20 from lipopeptide M1 suggested the following amino acid sequence: N-Glu-Leu-Leu-Val-Asp-Leu-Leu-C (Table 2). The amino acid sequence of lipopeptide M1 was further verified after trimethylsilylation via gas chromatography-mass spectrometer (GC-MS) analysis. Compared with the standard amino acid sample, the result indicated the presence of leucine (Leu), aspartic acid (Asp), glutamic acid (Glu) and valine (Val) (Fig. S4). The peak area of each identified amino acid was normalized to determine the molar ratio of Leu: Asp: Glu: Val as 4: 1: 1: 1. The proposed amino acid sequence of the lipopeptide M1, N-Glu-Leu-Leu-Val-Asp-Leu-Leu-C, was consistent with the structure of a typical surfactin [38].
After hydrolyzed into free fatty acid and trimethylsilylation derivatization, the trimethylsilylation method was used to evaluate the hydrophobic part of lipopeptide M1 with GC-MS (Fig. 4). The BSTFA (N, O-bis(trimethylsilyl)-trifluoroacetamide) reacted with the fatty acid part of the original surfactin sample, which consisted of β-hydroxyl fatty acid replacing active hydrogens with -Si(CH3)3. The characteristic peak of m/z = 233 suggested that there was a structure of -[CHO(Si(CH3)3)CH2COOSi(CH3)3]+, replacing one hydrogen and one methyl on the -[CH(OH)CH2COOCH3]+ (m/z = 103) with - Si(CH3)3 group, which is the characteristic fragmentation ion of a β-hydroxy fatty acid. Since the peak of m/z = 429 corresponded to the molecular mass of the analyzed fatty acid part, the presence of this peak confirmed the structure of the fatty acid chain with 18 carbon atoms. 13C NMR spectroscopy is a direct way applied to determine the structures of fatty acid variations in lipopeptide (iso, anteiso and normal). In addition to NMR, the structure of the fatty acid chains in cyclic lipopeptide can be easily determined via MS spectrogram according to the ratio of the relative intensity of I43/I57 [39]. The relatively larger, smaller and middle values correspond to iso-, anteiso- and n-, respectively. Therefore, the structure of the fatty acid chain in lipopeptide M1 was considered a linear structure.
With the above data, we performed a detailed NMR analysis to further confirm the lipopeptide structure of the molecule. Inspection of the 1H NMR spectra revealed signals corresponding to seven amide groups (-NH-) protons at δH 7.35–8.08 ppm, while the α-amino acid protons of the amino acids at δH 4.47–5.10 ppm (Fig. S5). A doublet at δH 0.83–0.97 ppm indicated a long alkyl chain of the lipopeptide M1 [-(CH3)2-CH-]. Additional multiples in the up-field region arose from the side-chain protons of the amino acids, and the remaining spectra supported the presence of β-hydroxy fatty acid. The 13C-NMR spectrum showed strong signals δC at 11.42–17.82, 18.41–58.45, and 170.82-173.89 ppm from methyl, methylene, and carboxyl groups, respectively (Fig. S6), which was consistent with the mass spectrometry results.
To sum up, the lipopeptide produced by B. subtilis TD7 could be preliminarily concluded as a new surfactin member, the surfactin-C18, which was confirmed by a combination of ESI-MS, LC-MS, GC-MS, 1H NMR and 13C NMR analyses. The primary structure of the surfactin-C18 incorporates a common peptide loop of seven amino acids (N-Glu-Leu-Leu-Val-Asp-Leu-Leu-C) containing 18 carbons in the linear fatty acid chain (Fig. 5).
Surface activity of the surfactin-C18
A critical property of biosurfactants is their ability to form micelles and reduce the surface tension or interfacial tension at air/aqueous and oil/aqueous interfaces [40]. As shown in Fig. 6A, with the increasing concentration of surfactin-C18 solution, surface tension at various concentrations can be readily decreased to below 28 mN/m, with the highest tested concentration being 1 mmol/L. According to the breakpoint of the plot of surface tension (γ) versus logic, the critical micelle concentration (CMC) value of surfactin-C18 was 1.99 µmol/L at 25 °C, with a corresponding surface tension at CMC (γCMC) of 28.63 mN/m. The adsorption isotherm properties of surfactin-C18 solution were determined using the γ-lgc fitting curve, including the surface excess concentration (Γ), the area occupied by the molecule (A), and the free energy of micellization (ΔG) (Table S3). The calculation showed that the maximum surface excess concentration (Гmax) and the corresponding minimum area occupied by one surfactin-C18 molecule (Amin) at the air/liquid interface were estimated to be 1.17 µmol/m2 and 141.93 Å2, respectively, indicating a compact molecular arrangement of surfactin-C18 at the air/aqueous interface.
Biosurfactants are distinguished by their amphiphilic structure with a tendency to adsorption at various interfaces, such as air/water and oil/water interfaces[41]. A fundamental parameter for evaluating the performance of biosurfactants is the CMC value, showing the ability to reduce the surface tension at the air/water interface [42]. As biosurfactant concentration increases, surface tension continues to decrease until reaching a concentration termed the CMC value, beyond which the first micelle forms, leading to significant variation in the physicochemical properties of the solution, while above the CMC, the surface tension is extensively independent of concentration [43, 44]. Generally, the biosurfactants with lower CMC values tend to reduce surface tension more effectively at lower concentrations, exhibiting better surface/interface properties in wetting, dispersing, foam-forming, and emulsification [45]. Several methods can be used to determine the CMC value such as surface tension, electrical conductivity and UV-absorption spectroscopy measurements, and in this study, we employed the widely used approach based on the γ-lgc isotherm [46]. Notably, surfactin-C18 exhibited superior surface activity compared to other surfactin homologs that were previously reported [30]. For comparison, the CMC values of surfactin-C11, C12, C13, C14, C15, C16 and surfactin-C17 are 55.9, 55.6, 26.0, 7.53, 5.04, 4.04 and 4.02 µmol/L, respectively, with corresponding minimum surface tensions of 34.42, 32.70, 27.09, 27.35, 28.52, 28.82 and 28.98 mN/m, respectively (Fig. 7). These values highlight that surfactin-C18 ranks among the most effective surfactins in reducing surface tension. Importantly, when compared to some commercial synthetic surfactants, such as non-ionic surfactant Triton-X 165 (Table S3), surfactin-C18 also exhibited higher efficiency in lowering the surface tension, reinforcing its potential as a high-performance biosurfactant. Additionally, surfactin exhibits the notable ability to maintain low surface tension under various environmental conditions in the pH range from 5 to 13, temperature range from 4 ℃ to 121 ℃ and salinities up to 9%, along with a wide range of bioactive properties, including antimicrobial, antiviral, antibiofilm and hemolysis [47,48,49]. Consequently, surfactin holds promise as a therapeutic agent, a biosurfactant and a bioremediation agent in potential industrial and therapeutic applications.
The adsorption properties of biosurfactants are mainly related to the chemical structure of the hydrophobic and hydrophilic moiety, including the length of the hydrophobic tail, the type of the peptide moiety, and the overall molecular size [41]. In the case of the structural characteristics, all the above surfactins have the same hydrophilic heads but differ in the length of their hydrophobic tails. Consequently, variations in CMC values can be possibly explained by the difference in the structure and length of the fatty acid chains: the longer hydrophobic chain results in improved surface-active performance. From an isothermal perspective, the biosurfactant concentration in the monolayer at the air/water interface was determined according to the γ-lgc isotherm and Gibbs adsorption isotherm. As illustrated in Table S3, the adsorption amount of surfactin on the air/aqueous interface increased with the increase in the hydrophobic chain length. The surface area of the monolayer expanded from 0.43 µmol/m2 to 1.17 µmol/m2, while the free energy decreased from − 24.27 kJ/mol to -32.52 kJ/mol. A possible explanation for this trend is that a longer hydrophobic chain length facilitates interaction at the air/aqueous interface and alters the relative position of the peptide ring, resulting in a reduction of the area occupied by a single molecule. These results clearly indicate that the surface activity of the surfactins enhanced with increasing fatty chain length, which is more conducive to micelle formation. Additionally, the micelle shape was evaluated based on the critical packing parameter (CPP) values, and as expected, the longer hydrophobic chain had an insignificant influence on the micelle structure, which remained spherical micelles (CPP < 1/3). Moreover, previous studies have demonstrated that as the length of the hydrophobic chain in surfactin increased, the diameter of aggregates significantly increased with the formation of larger aggregates [50]. Specifically, as the fatty acid chains of surfactins extend from 12 carbon atoms to 16 carbon atoms, β-sheet interaction between the micelles is reinforced, making surfactin more likely to form large aggregates. Despite the promising surface/interface-active properties of surfactin, the scalability of surfactin production remains the major limitation of the industrial application. We previously increased the surfactin yield produced by B. subtilis TD7 from 0.62 g/L to 2.14 g/L by promoter substitution [51]. In the future, our group will conduct in-depth research on the issue of yield improvement through optimizing the fermentation process, genomic engineering and synthetic biology strategies. Separately, using a microfluidic diffusion platform and molecular dynamics simulation to further clarify the molecular mechanisms at various interfaces. Addressing these challenges would expand the applications of surfactin in bioremediation, microbial enhanced oil recovery, food and pharmaceutical fields.
Conclusions
In summary, this work highlighted the significant biosynthetic potential of the Bacillus subtilis TD7 strain for lipopeptide production through genome analysis and mining. AntiSMASH has been utilized to discover that four of fourteen BGCs are related to lipopeptide biosynthesis, including those for surfactin, fengycin, bacillaene and bacillibactin. In addition, we reported the isolation and structural characterization of a new member of the surfactin family, surfactin-C18, produced by B. subtilis TD7. Surfactin-C18 compound was successfully purified using organic extraction followed by preparative HPLC. Its structural features were elucidated using a combination of analytical methods, including ESI-MS, LC-MS, GC-MS and NMR, confirming a heptapeptide ring linked to the longest fatty acid moiety reported to date. Moreover, the surface activity test demonstrated surfactin-C18 exhibited a CMC value of 1.99 µmol/L and a corresponding γCMC of 28.63 mN/m, Compared with other surfactin homologs and commercial surfactants, surfactin-C18 exhibits superior surface-active property. This study reinforces the excellent potential of microorganisms as a valuable resource for novel natural product discovery and emphasizes the importance of continued exploration of Bacillus species with a biosynthetic potential. Furthermore, surfactin-C18 presents a promising alternative to traditional synthetic surfactants and other surfactin homologs, with potential applications in microbial enhanced oil recovery, food and cosmetics industry.
Methods
Strain and culture conditions
The Bacillus subtilis TD7 strain, used in the current study, was obtained from the Daqing oilfield, China [52]. The strain was stored in a 50% (v/v) LB-glycerol solution at -80 ℃ in our laboratory. Initially, a frozen glycerol culture stock was streaked onto a LB agar plate (10 g/L NaCl, 10 g/L peptone, 5 g/L yeast and 2.5 g/L agar powder) for 24 h, and the colony was obtained. Then a single colony was transferred to 100 mL LB liquid medium, and incubated at 37℃ and 180 rpm for 20 h, as the seed culture.
Fermentation conditions
The 2% (v/v) seed culture was inoculated in 250 mL Erlenmeyer flasks containing 100 mL optimized cultivation medium on rotary shakers at 37 ℃, 200 rpm for 72 h, including 20.000 g/L sucrose, 2.450 g/L KH2PO4, 3.970 g/L Na2HPO4, 1.340 g/L NH4Cl, 2.125 g/L NaNO3, 1.000 g/L yeast, 0.100 g/L MgSO4, 0.070 mmol/L CaCl2 and 0.100 mmol/L MnCl2. For the above medium, pH was adjusted to 7.0-7.5 using 1 mol/L NaOH before autoclaved sterilization. After 72 h cultivation, bacteria were pelleted by centrifugation (Sigma Centrifuge 3K15, Rotor No. 12150) at 8000 rpm for 20 min and the cell-free supernatant was obtained.
Extraction and purification
Briefly, the lipopeptide extract was obtained by a combination of acid precipitation and ethyl acetate extraction methods. 6 mol/L HCl was titrated to adjust the pH value of the supernatant to pH < 2.0, and the suspension was left at 4 ℃ overnight. The suspension was centrifuged at 8000 rpm for 10 min to get the acid precipitation, by washing three times with deionized water. Subsequently, the precipitation was lyophilized until dry and then extracted with ethyl acetate three times to obtain lipopeptide extracts. The crude extracts were suspended in 100% methanol and injected into a reverse-phase high-performance liquid chromatography (RP-HPLC) (JASCO, Japan) after filtering through a 0.22 μm filter membrane. Mobile phase A was 100% methanol and mobile phase B was ultra-pure water with 0.05% (v/v) TFA. The proportion of the mobile phases was controlled via the following gradient program: 0–90 min, 90% A and 90–100 min, 100% A. The elution fraction of peaks was collected manually and freeze-dried to perform structural analysis.
Genome sequencing and assembly
Genomic DNA of Bacillus subtilis TD7 from a pure culture using the PowerSoil DNA Isolation Kit (MO BIO, USA) and the genome sequencing was performed using Illumina Hiseq 2000 sequencer according to the standard protocol, including DNA extraction, DNA sample quality testing, library construction, library purification, library detection, library quantification, generation of sequencing clusters, and up-sequencing. After obtaining the whole genome sequence, the bioinformatic work was mainly performed based on three databases: COG (Clusters of Orthologous Groups, http://www.ncbi.nlm.nih.gov/COG/), GO (Gene Ontology, http://www.geneontology.org) and KEGG (Kyoto Encyclopedia of Genes and Genomes, http://www.genome.jp/kegg/). BGCs responsible for the biosynthesis of secondary metabolites were identified using the antiSMASH webserver [53].
Structural elucidation
Generally, the eluted fraction was obtained by RP-HPLC, and a detailed structural characterization was performed by a diverse range of analytical methods as previously described [17]. The molecular mass of the lipopeptide extract was determined by high-resolution electrospray ionization mass spectrometry (ESI-MS) (Water, America) at the Analysis and Test Centre, East China University of Science and Technology. The m/z values were measured from 100 to 2000. Additionally, the lipopeptide extract was hydrolyzed with 1 mL 6 mol/L HCl at 90 ℃ for 20 h in a stoppered test tube, followed by the removal of excess solvent with air blow at 60 ℃. The fatty acid residue was labeled with 0.5 mL of acetonitrile-BSTFA (N, O-bis (trimethylsilyl)-trifluoroacetamide, 3:2, v/v), and heated at 60 ℃ for 20 min. The labeled sample was injected into GC-MS prior to adding 1 mL acetonitrile to dissolve. Moreover, the analysis of the amino acids was also carried out by GC-MS after acid hydrolysis and trimethylsilylation. The GC-MS analysis was performed on a 6890 GC system (Agilent, USA) coupled with a 5975 MSD (Agilent, USA) and equipped with an HP-5MS capillary column (30 m × 0.25 mm × 0.25 μm). The sample after silylation was injected and analyzed with EI operating at 70 eV and used a source temperature of 230 °C. The column oven temperature was held initially at 60 °C for 3 min, then increased to 250 °C at a rate of 10 °C/min, and held at 250 °C for 5 min. Other operating conditions were as follows: helium carrier gas, 99.999%; flow rate, 1.0 mL/min; injector temperature, 250 °C; injector volume, 2.0 µL; and split ratio, 20:1. All NMR spectra were recorded at 298 K on a Bruker Avance 600 MHz with 1.0 mL CDCl3 as a solvent.
Surface property assay
Different amounts of lipopeptide were completely dissolved in water to form a series of solutions with concentrations ranging from 0.01 to 1% (w/v). The surface tension of these solutions was measured at 25 °C using a DCAT 21 tensiometer (Dataphysics, Germany). The critical micelle concentration (CMC) value and the surface tension at CMC (γCMC) were calculated according to the plot of surface tension versus lipopeptide concentration [54]. The derivative of the equilibrium surface tension curve (dγ/dlgc) at the CMC can be used to estimate surface properties, including the maximum surface excess concentration (Гmax) and the minimum area occupied by a surfactin molecule (Amin) at the air/liquid interface [55]. The calculation of the above parameters was carried out using the following equations:
Where γ is the surface tension, R is the universal gas constant (8.314 J mol− 1 K− 1), T is the absolute temperature, and c is the surfactant concentration, and N is the Avogadro number (6.022 × 1023).
Statistical analysis
All experiments were conducted in at least three independent biological replicates, with each sample measured in triplicate for each assay. Data are presented as mean ± standard deviation (SD) using OriginPro 2023. The presented images are representative of multiple independent experiments that yielded consistent results.
Data availability
No datasets were generated or analysed during the current study.
Abbreviations
- BGC:
-
Biosynthetic gene clusters
- CDS:
-
Protein-coding genes
- NRPSs:
-
Nonribosomal peptide synthetases
- COG:
-
Clusters of Orthologous Groups
- GO:
-
Gene Ontology
- KEGG:
-
Kyoto Encyclopedia of Genes and Genomes
- ESI-MS:
-
Electrospray ionization mass spectrometry
- RP-HPLC:
-
Reverse-phase high-performance liquid chromatography
- LC-MS:
-
Liquid chromatography-mass spectrometry
- DHT:
-
The double hydrogen transfer
- TLC:
-
Thin-layer chromatography
- GC-MS:
-
Gas chromatography-mass spectrometer
- Leu:
-
Leucine
- Asp:
-
Aspartic acid
- Glu:
-
Glutamic acid
- Val:
-
Valine
- NMR:
-
Nuclear magnetic resonance
- CMC:
-
Critical micelle concentration
- CPP:
-
Critical packing parameter
- MEOR:
-
Microbial enhanced oil recovery
References
Sreedharan SM, Rishi N, Singh R. Microbial lipopeptides: properties, mechanics and engineering for novel lipopeptides. Microbiol Res. 2023;271:127363.
Sani A, Qin W-Q, Li J-Y, Liu Y-F, Zhou L, Yang S-Z, et al. Structural diversity and applications of lipopeptide biosurfactants as biocontrol agents against phytopathogens: A review. Microbiol Res. 2024;278:127518.
Chen MC, Liu TT, Wang JP, Chen YP, Chen QX, Zhu YJ, et al. Strong inhibitory activities and action modes of lipopeptides on lipase. J Enzyme Inhib Med Chem. 2020;35(1):897–905.
Zhu ZW, Zhang BY, Cai QH, Cao YQ, Ling JJ, Lee K, et al. A critical review on the environmental application of lipopeptide micelles. Bioresour Technol. 2021;339:125602.
Qiao J, Borriss R, Sun K, Zhang R, Chen X, Liu Y, et al. Research advances in the identification of regulatory mechanisms of surfactin production by Bacillus: a review. Microb Cell Fact. 2024;23(1):100.
Chen H, Zhong L, Zhou H, Bai X, Sun T, Wang X, et al. Biosynthesis and engineering of the nonribosomal peptides with a C-terminal Putrescine. Nat Commun. 2023;14(1):6619.
Li Y, Liu L, Song S, Kuang H, Xu C. A rapid and semi-quantitative gold nanoparticles based strip sensor for polymyxin B sulfate residues. Nanomaterials (Basel). 2018;8(3).
Dey R, De K, Mukherjee R, Ghosh S, Haldar J. Small antibacterial molecules highly active against drug-resistant Staphylococcus aureus. MedChemComm. 2019;10(11):1907–15.
Markande AR, Patel D, Varjani S. A review on biosurfactants: properties, applications and current developments. Bioresour Technol. 2021;330:124963.
Zhou L, de Jong A, Yi Y, Kuipers OP. Identification, isolation, and characterization of medipeptins, antimicrobial peptides from Pseudomonas mediterranea EDOX. Front Microbiol. 2021;12:732771.
Zheng W, Wang X, Zhou H, Zhang Y, Li A, Bian X. Establishment of recombineering genome editing system in Paraburkholderia megapolitana empowers activation of silent biosynthetic gene clusters. Microb Biotechnol. 2020;13(2):397–405.
Aleti G, Sessitsch A, Brader G. Genome mining: prediction of lipopeptides and polyketides from Bacillus and related Firmicutes. Comput Struct Biotechnol J. 2015;13:192–203.
Vater J, Herfort S, Doellinger J, Weydmann M, Borriss R, Lasch P. Genome mining of the lipopeptide biosynthesis of paenibacillus polymyxa e681 in combination with mass spectrometry: discovery of the lipoheptapeptide paenilipoheptin. ChemBioChem. 2018;19(7):744–53.
Arima K, Kakinuma A, Tamura G. Surfactin, a crystalline peptidelipid surfactant produced by Bacillus subtilis: isolation, characterization and its Inhibition of fibrin clot formation. Biochem Biophys Res Commun. 1968;31(3):488–94.
Moro GV, Almeida RTR, Napp AP, Porto C, Pilau EJ, Lüdtke DS, et al. Identification and ultra-high-performance liquid chromatography coupled with high-resolution mass spectrometry characterization of biosurfactants, including a new surfactin, isolated from oil-contaminated environments. Microb Biotechnol. 2018;11(4):759–69.
Romano A, Vitullo D, Di Pietro A, Lima G, Lanzotti V. Antifungal lipopeptides from Bacillus amyloliquefaciens strain BO7. J Nat Prod. 2011;74(2):145–51.
Liu X, Yang S, Mu B. Production and characterization of a C15-surfactin-O-methyl ester by a lipopeptide producing strain Bacillus subtilis HSO121. Process Biochem. 2009;44(10):1144–51.
Olasanmi IO, Thring RW. Evaluating rhamnolipid-enhanced washing as a first step in remediation of drill cuttings and petroleum-contaminated soils. J Adv Res. 2020;21:79–90.
Yang L, Chen C, Liang T, Hao L, Gu Q, Xu H, et al. Disassembling ability of lipopeptide promotes the antibacterial activity. J Colloid Interface Sci. 2023;649:535–46.
Li Y, Héloir MC, Zhang X, Geissler M, Trouvelot S, Jacquens L, et al. Surfactin and Fengycin contribute to the protection of a Bacillus subtilis strain against grape downy mildew by both direct effect and defence stimulation. Mol Plant Pathol. 2019;20(8):1037–50.
Fei D, Liu FF, Gang HZ, Liu JF, Yang SZ, Ye RQ, et al. A new member of the surfactin family produced by Bacillus subtilis with low toxicity on erythrocyte. Process Biochem. 2020;94:164–71.
Kaspar F, Neubauer P, Gimpel M. Bioactive secondary metabolites from bacillus subtilis: a comprehensive review. J Nat Prod. 2019;82(7):2038–53.
Singh SS, Sharma D, Baindara P, Choksket S, Harshvardhan, Mandal SM, et al. Characterization and antimicrobial studies of iturin-like and bogorol-like lipopeptides from Brevibacillus spp. Strains gi9 and skdu10. Front Microbiol. 2021;12:729026.
Xia L, Miao Y, Cao Al, Liu Y, Liu Z, Sun X, et al. Biosynthetic gene cluster profiling predicts the positive association between antagonism and phylogeny in Bacillus. Nat Commun. 2022;13(1):1023.
Kiesewalter HT, Lozano-Andrade CN, Wibowo M, Strube ML, Maróti G, Snyder D et al. Genomic and chemical diversity of Bacillus subtilis secondary metabolites against plant pathogenic Fungi. mSystems. 2021;6(1).
Zhang SY, Chen YL, Zhu J, Lu QJ, Cryle MJ, Zhang YM, et al. Structural diversity, biosynthesis, and biological functions of lipopeptides from Streptomyces. Nat Prod Rep. 2023;40(3):557–94.
Skinnider MA, Merwin NJ, Johnston CW, Magarvey NA. PRISM 3: expanded prediction of natural product chemical structures from microbial genomes. Nucleic Acids Res. 2017;45(W1):W49–54.
Medema MH, Blin K, Cimermancic P, de Jager V, Zakrzewski P, Fischbach MA, et al. AntiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 2011;39(suppl2):W339–46.
van Heel AJ, de Jong A, Song C, Viel JH, Kok J, Kuipers OP. BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins. Nucleic Acids Res. 2018;46(W1):W278–81.
Qin WQ, Fei D, Zhou L, Guo YJ, An S, Gong OH, et al. A new surfactin-C17 produced by Bacillus subtilis TD7 with a low critical micelle concentration and high biological activity. New J Chem. 2023;47(16):7604–12.
Qin WQ, Liu YF, Liu JF, Zhou L, Yang SZ, Gu JD, et al. The metabolic and physiological responses to spaceflight of a lipopeptide-producing Bacillus subtilis. Microb Biotechnol. 2025;18(3):e70111.
Zhao Y, Yang SZ, Mu BZ. Quantitative analyses of the isoforms of surfactin produced by Bacillus subtilis HSO 121 using GC-MS. Anal Sci. 2012;28(8):789–93.
Meng Y, Zhao W, You J, Gang H-Z, Liu J-F, Yang S-Z, et al. Structural analysis of the lipopeptide produced by the Bacillus subtilis mutant R2-104 with mutagenesis. Appl Biochem Biotechnol. 2016;179(6):973–85.
Yu FT, Shen YY, Qin YL, Pang YY, Fan HL, Peng JJ, et al. Isolation and purification of antibacterial lipopeptides from Bacillus velezensis YA215 isolated from sea mangroves. Front Nutr. 2022;9:1064764.
Peypoux F, Besson F, Michel G, Delcambe L. Structure of bacillomycin D, a new antibiotic of the Iturin group. Eur J Biochem. 1981;118(2):323–27.
Yang SZ, Wei DZ, Mu BZ. Determination of the amino acid sequence in a Cyclic lipopeptide using MS with DHT mechanism. J Biochem Biophys Methods. 2006;68(1):69–74.
Liu X, Yang S, Mu B. Isolation and characterization of a C12-lipopeptide produced by Bacillus subtilis HSO 121. J Pept Sci. 2008;14(7):864–75.
Ciurko D, Czyżnikowska Ż, Kancelista A, Łaba W, Janek T. Sustainable production of biosurfactant from agro-industrial oil wastes by Bacillus subtilis and its potential application as antioxidant and Ace inhibitor. Int J Mol Sci. 2022;23(18):10824.
Yang S, Wei D, Mu B. Determination of the structure of the fatty acid chain in a Cyclic lipopeptide using GC–MS. J Biochem Biophys Methods. 2007;70(3):519–23.
Kalvandi S, Garousin H, Pourbabaee AA, Alikhani HA. Formulation of a culture medium to optimize the production of lipopeptide biosurfactant by a new isolate of Bacillus Sp.: a soil heavy metal mitigation approach. Front Microbiol. 2022;13:785985.
Rekiel E, Zdziennicka A, Szymczyk K, Jańczuk B. Thermodynamic analysis of the adsorption and micellization activity of the mixtures of rhamnolipid and surfactin with triton X-165. Molecules. 2022;27(11):3600.
Chen C, Zhu M, Li M, Fan Y, Sun RC. Epoxidation and etherification of alkaline lignin to prepare water-soluble derivatives and its performance in improvement of enzymatic hydrolysis efficiency. Biotechnol Biofuels Bioprod. 2016;9:87.
Marchant R, Banat IM. Microbial biosurfactants: challenges and opportunities for future exploitation. Trends Biotechnol. 2012;30(11):558–65.
Yekeen N, Padmanabhan E, Idris AK, Ibad SM. Surfactant adsorption behaviors onto shale from Malaysian formations: influence of silicon dioxide nanoparticles, surfactant type, temperature, salinity and shale lithology. J Petrol Sci Eng. 2019;179:841–54.
Zhao F, Shi R, Ma F, Han S, Zhang Y. Oxygen effects on rhamnolipids production by Pseudomonas aeruginosa. Microb Cell Fact. 2018;17(1):39.
Hutin A. Application Notes -Method: 4. Measurement of Critical Micelle Concentration (CMC).2021.
Su Y, Gao L, Li C, Wang L, Zhou H, Zhang C, et al. Regulation mechanism and bioactivity characteristic of surfactin homologues with C14 and C15 fatty acid chains. Microb Cell Fact. 2024;23(1):94.
Abdelraof M, Nooman MU, Hashem AH, Al-Kashef AS. Production and optimization of surfactin produced from locally isolated Bacillus halotolerans grown on agro-industrial wastes and its antimicrobial efficiency. BMC Microbiol. 2024;24(1):193.
Phulpoto IA, Yu Z, Hu B, Wang Y, Ndayisenga F, Li J, et al. Production and characterization of surfactin-like biosurfactant produced by novel strain Bacillus nealsonii S2MT and it’s potential for oil contaminated soil remediation. Microb Cell Fact. 2020;19(1):145.
Baccile N, Seyrig C, Poirier A, Alonso-de Castro S, Roelants SLKW, Abel S. Self-assembly, interfacial properties, interactions with macromolecules and molecular modelling and simulation of microbial bio-based amphiphiles (biosurfactants). A tutorial review. Green Chem. 2021;23(11):3842–944.
Mu BZ, Liu FF, Liu YF, Qiao YW, Guo YZ, Kuang FY, et al. Improvement surfactin production by substitution of promoters in Bacillus subtilis TD7. Appl Environ Biotechnol. 2021;6(1):11.
Liu JF, Yang J, Yang SZ, Ye RQ, Mu BZ. Effects of different amino acids in culture media on surfactin variants produced by Bacillus subtilis TD7. Appl Biochem Biotechnol. 2012;166(8):2091–100.
Blin K, Shaw S, Kloosterman AM, Charlop-Powers Z, van Wezel GP, Medema Marnix H, et al. AntiSMASH 6.0: improving cluster detection and comparison capabilities. Nucleic Acids Res. 2021;49(W1):W29–35.
Perinelli DR, Petrelli D, Vitali LA, Bonacucina G, Cespi M, Vllasaliu D, et al. Quaternary ammonium leucine-based surfactants: the effect of a benzyl group on physicochemical properties and antimicrobial activity. Pharmaceutics. 2019;11(6):287.
Lucarini S, Fagioli L, Cavanagh R, Liang W, Perinelli DR, Campana M, et al. Synthesis, structure-activity relationships and in vitro toxicity profile of lactose-based fatty acid monoesters as possible drug permeability enhancers. Pharmaceutics. 2018;10(3):81.
Acknowledgements
The authors would like to thank The Research Centre of Analysis and Test of East China University of Science and Technology for the help on ESI-MS and NMR analyses.
Funding
This work was supported by the National Key Research and Development Program (2022YFC2105200), the Fundamental Research Funds for the Central Universities of China (22221818014), and the Research Program of the State Key Laboratory of Bioreactor Engineering.
Author information
Authors and Affiliations
Contributions
W.Q.Q designed and performed the experiments, analyzed all results and wrote the manuscript. Y.F.L wrote and revised the manuscript. L.Z. and J.F.L. provided bioresource. D.F. and K.H.X. contributed to the structural characterization assay. S.Z.Y. and J.D.G. revised the manuscript. B.Z.M. conceptualized and supervised the research. All authors reviewed and approved the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Qin, WQ., Liu, YF., Zhou, L. et al. Genome mining, structural elucidation and surface-active property of a new lipopeptide from Bacillus subtilis. Microb Cell Fact 24, 106 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12934-025-02723-y
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12934-025-02723-y