Detection and analysis of Short Linear Motif based protein-protein interactions with SLiMAn2 web server

doi:10.21203/rs.3.rs-3973092/v1

Detection and analysis of Short Linear Motif based protein-protein interactions with SLiMAn2 web server

2024 · doi:10.21203/rs.3.rs-3973092/v1

preprint OA: closed

Full text JSON View at publisher

Full text 141,379 characters · extracted from preprint-html · click to expand

Detection and analysis of Short Linear Motif based protein-protein interactions with SLiMAn2 web server | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Method Article Detection and analysis of Short Linear Motif based protein-protein interactions with SLiMAn2 web server Alexandre Mezghrani, Juliette SIMON, Victor Reys, Gilles Labesse This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-3973092/v1 This work is licensed under a CC BY 4.0 License Status: Posted Version 1 posted You are reading this latest preprint version Abstract Interactomics is bringing a deluge of data regarding protein-protein interactions (PPIs) which are involved in various molecular processes in all types of cells. However, this information does not easily translate into direct and precise molecular interfaces. This limits our understanding of each interaction network and prevents their efficient modulation. A lot of the detected interactions involve recognition of short linear motifs (SLiMs) by a folded domain while others rely on domain-domain interactions. Functional SLiMs hide among a lot of spurious ones, making deeper analysis of interactomes tedious. Hence, actual contacts and direct interactions are difficult to identify. Consequently, there is a need for user-friendly bioinformatic tools, enabling rapid molecular and structural analysis of SLiM-based PPIs in a protein network. In this chapter, we describe the use of the new webserver SLiMAn to help digging into SLiM-based PPIs in an interactive fashion. Structural Biology Bioinformatics Interactomes proteome annotations comparative modeling protein sequence Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 Figure 8 Figure 9 1. Introduction In all organisms, from virus to mammals, proteins, throughout their lifespan, interact with a large number of partners, especially other proteins. The number of protein-protein interactions (PPIs) can be reasonably estimated to be in the order of a million in each eukaryotic cell [1]. At the basis of life plasticity, the vast majority of an interactome is mainly constituted of transient PPIs. For example, for the 20% of proteins that are secreted, even before they reach their proper location, they have transiently interacted with dozens of different partners along the secretion pathway, post-translationally shaping their molecular and structural characteristics [2]. Similarly, cellular protein degradation systems involve sophisticated machinery in which targeted proteins interact transiently with multiple proteins [3]. Many signaling cascades rely also on such weak PPIs [4]. As these interactions can be transient, with low affinity and often spanning a few percent of the total of gene products, they have been largely underestimated in cell biology and hard to characterize. During the last two decades, substantial technical progress has been made in interactomics with high-throughput Affinity Purification Mass Spectrometry (AP-MS) as well as with large scale yeast two-hybrid system allowing the identification of around half-a-million of interactions for the ~20000 human proteins [5–7]. Whereas classical AP-MS is well suited to identify high-affinity interactions involved in stable core protein complexes, the detection of true transient protein-protein interactions is more challenging even in presence of cross-linkers [8]. Another big challenge in AP-MS is discrimination between direct or indirect interactions in a proteome. In contrast, two-hybrid systems focus on binary interactions, and appear more suitable for transient protein interaction detection with the disadvantage of being reductionist and not perfectly physiological for non-yeast proteins [9]. Consequently, this method has a high rate of false positive and negative hits although careful and redundant strategy can help improving these statistics [10]. Low-throughput studies in the test tube, like complex reconstitution, are time consuming, biased, and often non-physiological but highly informative at the molecular and structural levels. New methodologies emerged to unravel undiscovered PPI such as dedicated phage-display systems [11], systematic directed mutagenesis using Crispr-Cas9 technology [12] and massive cross-linking (diazirine photo-crosslinking and MS) [13]. Despite the different limits mentioned above, systematic studies with each technique for a defined macromolecule pave the way to a complete human interactome resolution. Combining the various outputs to gain better insights is useful but can be complex despite remarkable early attempts [14, 15]. Numerous molecular and structural studies have highlighted that many PPIs involve a folded domain recognizing short linear motifs (SLiMs) [16, 17]. Several hundred of SLiMs are playing key roles in all cellular processes including cell signaling, metabolism and as well as protein trafficking [18]. SLiMs are also at the basis of all protein post-translational modifications (PTMs) like proteolysis, phosphorylation, ubiquitination, glycosylation and others, as protein-modifying enzymes like protein kinases and proteases usually recognize only a short segment of consecutive amino-acids [19]. SLiMs are often found in flexible loop regions for single globular proteins and in intrinsically disordered segments for multidomain proteins [20]. Not surprisingly, more than 20 of the 340 ELM SLiMs registered in the ELM database are localized at the N- or C-terminal protein parts (http://elm.eu.org/). Different resource databases gather interactomic data [21]. Most of them, like STRING [22], CM2D3 [23], BioGRID [24] and IntAct [25] retrieve biochemical and genetic data for a given gene product to show connection to putative partners – or interactants – and some systems score their reliability [22]. However, in most of them, the precise molecular and structural features of each PPI are not directly accessible. Furthermore, despite the high amount of information collected, various connections are missed even in STRING, as previously noted [26]. Tools dedicated to domain-domain interactions based on comparative modeling have been described [27] but there are limits to the number of complexes one can model. However, the newly developed artificial-intelligence (AI)-based approaches, such as AlphaFold multimer, provide a promising alternative to standard homology-based modeling [28]. Furthermore, the detection and modeling of SLiM-based interactions is hampered by the elusive nature of SLiMs. The SLiMAn webserver has been developed to integrate interactomics data in order to identify SLiMs and their recognition patterns as well as to perform comparative modeling (when possible) [26]. Here, we highlight enhanced features recently added in SLiMAn2 with examples run in analytical and discovery modes. 2. Material A primary version of SLiMAn has been already described [ 26 ] . Since then, the source code has been completely rewritten, new features were added and the web server rendering now fully exploit JavaScript for an enhanced interactivity and user experience. New tools or databases have been added in the version 2.0 of SLiMAn (available at: https://sliman2.cbs.cnrs.fr/ and described in more detail elsewhere (manuscript in preparation))(Figure 1). Among them, the IntAct database that brings additional experimental pairing while AlphaFold predictions [ 29 ] (through the huge database of systematically pre-computed structural models) increase confidence in the boundaries of folded and disordered regions in each protein (Figure 1). In addition, fast connection to PubMed accelerate screening of published material for a given pairing or protein (http://www.ncbi.nlm.nih.gov/pubmed). Other additional features are also described along their use in this chapter (Figure 1). 3. Methods 3.1 Query types. SLiMAn can be interrogated in two ways depending on the data available. Either a given interactome has been obtained (or selected from any source of data) and the corresponding list of proteins can be directly submitted to the webserver or the submitted entry corresponds to only one unique protein. In the latter case, SLiMAn interrogates BioGRID and/or IntAct to retrieve a (meta-)interactome. This constitutes a list of putative interactants to be submitted directly and analyzed using the very same webserver. Once the query list is submitted, the same workflow applies to the data regardless of their origin (Figure 1). In the case an original interactome is submitted, the data from BioGRID and/or IntAct can serve as a validation, as illustrated previously [26] and further in this protocol. To illustrate the different properties of SLiMAn2 for interactomic data analysis, a focused proteomic study on tankyrase 1 and 2 published by Li and colleagues in 2017, name hereafter the TNKS1/2 interactome is proposed [30]. The information extracted from this analysis is compared to the one resulting from performing a parallel analysis of the meta-interactome extracted from BioGRID and IntAct for the same two tankyrases. The goal is to define the molecular features of the different PPIs present in the interactomes from distinct sources and to reveal potential SLiM-based interactions involved in different functional networks. We describe below the step-by-step method used to partially decipher a SLiM-based protein network. 3.1.1. Querying a novel interactome 1. Collect identifiers (or accession codes) from Uniprot for all the partners of the TNKS1/2 interactome. 2. Start a new project on the webserver by submitting the list of names – separated by commas – on, the front page (Figure 2A). A project name can be input in the dedicated window. 3. Press the button “Find Interactions” just below and SLiMAn will start its analysis (see below 3.2). 3.1.2. Quick start from a given protein. Alternatively: Submit the Uniprot names for TNKS1 or 2 (TNKS1_HUMAN or TNKS2_HUMAN, respectively) to the menu “PPI Extension” on the SLiMAn front page (Figure 2B). In this case, SLiMAn queries BioGRID and IntAct that contain curated protein interactomic data for most model organism species. Press “Find Interactants” and SLiMAn-2 will retrieve all the interactomic data from these two databases (see 3.1.3). 3.1.3 Filtering putative interactants From a given UniProt protein name (or code), SLiMAn rapidly extracts the putative partners listed in BioGRID and/or IntAct. A resulting webpage interactively enlists, by default, the proteins associated with the query in any of those two databases. This list can range from very few to up to a thousand of protein partners. SLiMAn can manage more than one hundred of interactors but not a thousand in the current version. However, it is likely too difficult to survey too many partners at once. Hence, the number of interactants to study, can be filtered using the «Parameters» section shown just above the protein list. Data from only one database can be used instead or, on the contrary, one can focus on the intersection of the two databases, BioGRID and IntAct, for higher confidence. Note that there is a significant overlap between the two databases and in that case, this redundancy cannot be seen as a cross-validation. Otherwise, as the data from BioGRID are separated in low- and high-throughput classes while IntAct data are split in general data and those from HuRI (http://www.interactome-atlas.org/; [10]), one can reduce further and tune the query list before submission. For example, only low-throughput data from the BioGRID database can be selected or, instead, only HuRI proteomic data (direct PPIs) within IntAct data. For higher confidence, filtering in interactants detected by both low- and high-throughput methods and present in both databases, is a good choice, although at the expense of the number of partners. Each time, SLiMAn updates the list of proteins to survey. One can hoover the mouse on the query name to read the number of interactants enlisted. Once parameters are chosen, press the button “Quick launch” to launch SLiMAn analysis. 3.2 Main outputs. Whatever the type of data (provided list of protein or meta-data extracted for one given protein), SLiMAn will search for ELM motifs as well as the corresponding PFam domains (see Note 1) within the submitted protein sequences. To filter only relevant information, orphan or unpaired ELM motifs and PFam domains are discarded. This dramatically reduces the list of ELMs amenable to analysis, in contrast to a direct interrogation of the ELM database. The motifs and domains filtered in, are shown on the main table for further interactive analysis. This table highlights all the putative pairing between an ELM motif and the corresponding domain. Numerous parameters can be used to filter in or out, more or less protein pairs as illustrated below. 3.2.1. General view. In its upper part, the main result page recapitulates the query parameters (query name, number of partners, …). After an automatic setting of the filters (see Note 2), it tabulates the number of ELM motifs, PFam domains and number of proteins containing (or not) such motifs and/or domains (Figure 3). It also provides links to useful outputs for subsequent studies. In the central part of the webpage, eight modules of different filters are present with the corresponding buttons, cursors or digits for selection (Figure 4). The “ELM”, “Disorder”, “HSM” and “PSP+” filters allow to manage structural and molecular parameters of PPis (Figure 4A). The second part of the filter panel is dedicated to the selection (“PPI database”, “SLiMan” (level of confidence),”visual”) and display forms (“visual”) of PPis (Figure 4B). Several examples of the roles of this interface in interactive analysis of an interactome is devellop in more details below (see 3.3). In the lower part, a table is provided in which the ELM-PFam pairing computed by SLiMAn are highlighted. In the top row, PFam domains are listed associated with the UniProt identifier of the proteins containing them. In the left column, SLiMs corresponding to ELM entries, are listed in association with the UniProt identifier of the proteins containing them. Molecular features for each SLiM are detailed, namely, its ELM class, the corresponding regular expression and sequence motif as well as its location in the studied protein. If a PTM is annotated in the motif, as extracted from PhosphoSitePlus (herein PSP+; [31]), it is also highlighted (see below for more details). For each ELM-PFam pair highlighted by SLiMAn, a box is drawn and colored according to the corresponding confidence level. It contains three links and a check box. The latter can be used to select a pairing and collect the corresponding validated pairs for subsequent visualization in a table or a Cytoscape network (see below) [32]. The “Hit” link (upper right panel) opens up a pop-up window recapitulating all the parameters computed for the match (see Figure 5 and following sections for further details). The lower left box is a link to an alignment module and the possible launch of comparative modeling of the putative complex that the ELM motif and the matched domain could form (see 3.4). When actual modeling within the framework of SLiMAn is performed, a new link is created in the lower right box of the main result page. These additional steps may help assessing the likelihood of a pairing. Finally, below each column in the main table, a consensus sequence is computed and highlighted using LogoJS [33] for each ELM motif type paired with a given PFam domain. A selection button enables switching from one ELM type to another (e.g.: LIG_SH3_1 to LIG_SH3_9). Modifying thresholds for the various parameters triggers a new computation resulting in a new table of pairings and new logos. 3.2.2. Specific information for each ELM-PFam pair. In each ELM/PFam pair box within the main table, pressing the “Hit” button grants direct access to information regarding a given putative interaction in a dedicated pop-up window (Figure 4). This window details the ELM entry (motif name and E-value, sequence boundaries and whether or not it is part of the validated instance according to ELM), the experimental data extracted from BioGRID and IntAct databases (indicating the number of hits and providing links to the associated publications in PubMed), key biophysical parameters from IUPred2A [34] and AlphaFold [35] as well as the pairing likelihood score computed by HSM [36]. The IUPred2A section highlights the different scores (ANCHOR, IUPred local and global disorder or domain scores) for protein disorder predictions [34]. SLiMAn2 also provides the pLDDT score from pre-computed models stored in the AlphaFold database (AFDB) [29]. This score estimates the local accuracy, and was observed to nicely correlate to local disorder computed by IUPred. HSM is a dedicated predictor of interaction likelihood (from 0 to 1) for 6 types of recognition domains (PDZ, SH2, SH3, WW, WH1 and PTB) [36]. The number of experiments corresponding to a given pair of proteins within BioGRID and IntAct are split in categories (e.g.: Low vs High throughput methods for BioGRID data, as discuss above). Select “toggle details” to see the precise type of experiments used, the date of deposit in the database (for IntAct data), a reference in PubMed as well as a link to the associated publication (as extracted from BioGRID and IntAct information). Additional technical information from IntAct and HuRI are also displayed (see Note 3). In the upper-right corner of the pop-up window, “PubMed queries” links are provided to search PubMed for publications describing each protein as well as those associating the two proteins of that pair (Figure 6). The latter corresponds to a search in PubMed combining all the alternative identifiers, accession and gene names for each protein in the pair, using logical operators. It enables quick access to related research articles found in literature and can compensate the lack of deposition in PPI repositories. 3.3 Hands on SLiMAn interactivity The main page of results shows a subset of putative pairings extracted from the query list with a set of parameters tuned automatically (Figure 5). This works well with most queries, highlighting some ELM motifs but hiding many others due to current parameters such as, a too stringent (= small) E-value, or restrictive disorder parameters. The interactive fine-tuning of parameters supports the identification of promising pairings and the detection of direct interactions through a given motif and a corresponding domain. Additional analysis can be also initiated from this webpage such as sequence alignment and comparative modeling of motif-domain complexes. This part is described in more detail at the end of this chapter as it may require some expertise in structural biology to be easily handled and truly fruitful. But most of the analyses using SLiMAn rely on the frontpage. 3.3.1 Parameter and filter description We now briefly define the different filters available, while their specific use is illustrated in the next section. Known PPIs can be selected from BioGRID or IntAct similarly to the precedent step (3.1.3). A heuristic scale (1-8) of confidence, combining several criteria for a SLiM-based PPI (predicted biophysical properties, experimental evidences...), was set to allow the user to quickly select PPIs with the highest level of confidence. Several types of filters can be applied : “ELM” and “PSP+” filters are defined respectivley to: Set the upper bound threshold for the ELM motif E-value, set ELM class of SLiMs (cleavage, modification, targeting, degradation, docking, ligand) or the ELM validated instances. Filter putative pairings based on the presence/absence of a given posttranslational modifications (PTM) and its requirement for the motif to be functional. As several SLiMs contain one or more PTMs, corresponding experimental information within each motif is made available through a link to the PhosphoSitePlus database (https://www.phosphosite.org/homeAction ). Values of structural parameters (IUpred2A, AlphaFold) or pair likeliness (HSM) can be set and tuned to refine displayed SLiMAn predictions of PPis. Text filters are applicable to limit the analysis to a given type of motif, domain or protein. To that end, the corresponding ELM or PFAM expression or the protein name can be input in the visual toolbox (e.g.: SH3_1 in the ELM box or SH3 in the PFam box). 3.3.2 Consulting validated instances of SLiM-based PPIs The use of these different filters for interactomic data analyses is illustrated with a step-by-step and hierarchical approach which gradually defines the molecular features of the different PPIs found in a published interactome of the two human tankyrases (130 proteins) [30], members of Poly(ADP-Ribose) Polymerase proteins (PARPs) family, and their respective meta-interactomes from BioGRID-IntAct (170 and 80 interactants for TNKS1 and 2). The final results for those three searches in SliMAn can be found on the webserver (https://sliman2.cbs.cnrs.fr/study/TNKS1-2-Inter.html ; https://sliman2.cbs.cnrs.fr/study/TNKS1-Meta.html ; https://sliman2.cbs.cnrs.fr/study/TNKS2-Meta.html ). This approach aims at revealing potential SLIM-based interactions from the most likely to the less convincing ones while building a hierarchical molecular network. Step 1: Identification of the most likely PPIs Request “Display all” within the Visualization panel. Select the ELM valid instances without applying any filter (i.e.: select “Validated Instances” within the ELM panel and switch off other parameters). For the combined interactome TNKS1/2 [30], SLiMAn2 shows that 4 proteins would interact directly with tankyrase 1. These ELM valid instances involve a SLiM motif named Tankyrase Binding Motif (hereafter TBM) found in AXIN1, FNBP1, TERF1 and CASC3 and recognized by several ankyrin repeats (ARC or ANK) of TNKS1 and TNKS2 (Figure 5A) [37]. We can observe that these TBM motifs are predicted to be in unstructured regions (pLDDT 0.4). At this stage of analysis, no other type of SLiM-based PPIs appears to involve TNKS1 or TNKS2 while two other PPIs are revealed as ELM instances between GSK3b-AXIN1 and GSK3b-TP53. Here, two distinct linear motifs in AXIN1, its TBM (21-28) and its GSK3b docking site (383-389) would bridge GSK3b and TP53 to the tankyrases (see Note 4). Note that within the BioGRID/IntAct meta-interactomes, 4 ELM instances (TBM detected in AXIN1, FNBP1, TB182 and TERF1) are found for TNKS1 and none for TNKS2. Furthermore, it shows only a partial overlap with the TNKS1/2 interactome with 3 common interactants (out of 5 in total). Step 2: Detection of highly-confident SLiM-based PPIs Switch off the ELM instances and set the level of confidence to 8 with a low E-value (0.005). This increases the number of proteins displayed from 6 to 19 among 128 preys for TBNKS1/2. Fourteen proteins harbor ELM motifs putatively recognized by 7 PFam domains, which include 4 protein-kinase catalytic domains and 2 FHA domains on top of the Ankyrin repeats from the two tankyrases. Here, the ELM validated instance for CASC3 with TNKS1 is filtered out as no experimental evidence is recorded in BioGRID (requested Bio total > 0) or IntAct (requested IntAct+HuRI > 0) for that pair. Switch on the disordered parameters (Anchor > 0.4; Short and long Disorder > 0.4; pLDDT score < 60) to filter out a few motifs (15 out 202) and keep only most likely ones. At this level of confidence and filtering, two additional substrates of human tankyrases appear (BABA1 and GO45) and they are connected to no other proteins. AXIN1 is still connected to tankyrases as well as to GSK3s and therefore TP53. The table highlights three other sub-networks, one corresponding to a multimer of the protein-kinase Chk2 (through its FHA motif and domain), one connecting MRE11 and nibrin (again through an FHA pair) and a last one connecting the protein-kinase STK26 to STRN4, STRP1, and CT2NL, due to various phosphorylation motifs and multiple experimental evidences from BioGRID and IntAct. At a similar level of confidence and filtering, 10 PPIs involving 11 proteins are highlighted for the meta-interactome of TNKS1 whereas no additional PPIs was obtained for TNKS2. Interestingly, 5 proteins (AXIN1, FNBP1, GO45, TERF1 and BABA1) supposedly interacting with TNKS1 are common to the BioGRID/IntAct meta-interactome and the TNKS1/2 interactome. As illustrated with this example, SLiMAn facilitates the identification of both direct and indirect connections or possible ternary complexes. At such a stringent filtering, interactions or pairings predicted by SLiMAn merely match already well-known interactions. However, lowering the stringency, may result in too many pairings for simultaneous inspection. 3. Use text-based filtering to focus on one given type of pairings: Input PFam query: TNKS Input ELM query: DOC_ANK_TNKS_1 to display only the pairings involving Tankyrases and the DOC_ANK_TNKS_1 motif This selection leads to smaller table with one partner for TNKS2 and 6 for TNKS1 within the TNKS1/2 interactome. A similar trend is observed in the TNKS1 and TNKS2 meta-interactomes (with 6 interactants for TNKS1 and one for TNKS2). 4. Select Lower the confidence level to 6: three more TNSK2 partners (BCR, 3BP2 and TERF1) appear and only one (3BP2) for TNKS1 within the TNKS1/2 interactome, while up to 13 partners are found in the meta-interactome of TNKS1 and 8 for TNKS2. Among the 8 tankyrase binders within TNKS1/2 interactome, 6 are found also in the meta-interactomes of TNKS1 or TNKS2, and 3 of them (TERF1, 3BP2, BABA1) are shared by the three. This is still representing a tiny portion of all the preys listed in the various studies using human tankyrases as baits. This suggests that more pairings to the tankyrases may have to be characterized (or not) through SLiMAn interface by navigating at much lower stringency. Filter for predicted disordered using the above threshold remove only one validated partner (PAGE4) in TNKS1 meta-interactome [38], which can be brought back by increasing the AlphaFold pLDDT threshold to 65 (instead of 60). This pre-filtering analysis indicates the disorder parameters to select TBM from various Tankyrase partners. Step 3: Using biophysical filters to predict additional binders at low levels of confidence. As low confidence level can correspond to low disorder predictions and/or too few experimental evidences, one might want to counterbalance the low overall stringency by using parameters adapted to the particular pairings under scrutiny. As the TBM SLiM in ELM (DOC_TNKS_1) corresponds to highly flexible sequences, one can use rather stringent biophysical and structural features. Hence, Set IUPred2A and AlphaFold filters with high values (Anchor > 0.4; Short Disorder > 0.4; pLDDT score < 65). These values were derived from those observed for the TBM detected at high confidence level (first 8 and then 6; see above). This should allow us to dig into the (meta-)interactomes in a discovery mode and to spot more tankyrase partners actually bound through a TBM. Decrease the level of confidence to 4 (from 6). This reveals 21 putative interactors among the 128 preys (16%) of the TNKS1/2 interactome and 37 (out of 170, 22%) in the case of the TNKS1 meta-interactome and 24 proteins (out of 80, 30%) for TNKS2. Of note, 11 of those binders are potential new PPIs, whereas 10 are shared between TNKS1/2 and the two meta-interactomes and only 6 are found in the three interactomes. Decrease the level of confidence to 2 (from 4). Elven additional potential partners show up for the TNKS1/2 interactome bringing the total number of potential partners to 32. The low confidence scores (2) for most of those additional pairs (8/11), come from the lack of supporting experimental data within BioGRID and/or IntAct database, as the thresholds for disorder are stringent (and yield a confidence score of 2 by themselves). 14 binders are found in TNKS1/2 and the meta-interactomes, whereas 18 are new PPIs, and 7 are common to the three interactomes. It should be noted that a relatively small overlap is also observed between the two BioGRID/IntAct meta-interactomes with only 20 common proteins among the 62 potential TBM-dependant tankyrase binders. At first, these additional partners are questionable, as most of these new preys were obtained by only one independent experiment. Accordingly, they could need additional validations to ensure they indeed correspond to direct binding to one of the two tankyrases. Here, SLiMAn allows to point out which protein within the whole interactome, and which region in these proteins to prioritize in order to confirm this pairing. Step 4 : Adding alternative motif sequences in SLiMAn The relatively small number of tankyrase preys detected as direct binders, so far, indicates that other interactions are possibly still missed even at low stringency. Such failures could be due to domain-domain interactions (that cannot be shown explicitly by SLiMAn), to indirect interactions (see above and below), to the presence of divergent TBM or direct interactions mediated by other type of motifs and associated domains. The latter case is very likely as TNKS1 harbors several proline-rich SH3 binding motifs. Beside FNBP1 which possesses a SH3 domain but is also harboring a TBM, 3 nexins found in the TNKS1/2 interactome, do possess functional SH3 domains, while we detected no other connections to tankyrases otherwise. TNKS1 also harbors a PP2B docking motif and an FHA recognition motif. However, the latter is not phosphorylated according to PSP+ and, therefore, may be considered as not functional (see below). But one cannot exclude that the ELM motif is defined with a too stringent sequence signature. In fact, alternative motifs have been described in several substrates of tankyrases [39, 40]. Different from the stringent canonical TBM signature (DOC_ANK_TNKS_1: .R..[PGAV][DEIP]G.), the closely related (.R...[PGAV].G.) corresponds to a second motif with one additional residue within the same interacting partners [40]. Accordingly, search for potential alternative motifs that could fit into the TBM binding groove. Survey the crystal structures of tankyrase bound to various peptides Dig into the literature about tankyrase interactions. Structural studies corroborated by directed mutagenesis and affinity measurements, point to the importance of an acidic residue in +2 position of the strictly conserved glycine [37]. These alternative TBMs may possess the new signatures R.{2,3}[PGAVSCT].G.[DE] or R.{3,4}[NDQEIVPT]G.[DE]. Use the “Create your own RegEx” option to manually add a new signature to the initial query step in order to screen for additional Tankyrase substrates. Add the three patterns named respectively: Alt1 (.R...[PGAV].G.), Alt2 (R.{2,3}[PGAVSCT].G.[DE]) and Alt3 (R.{3,4}[NDQEIVPT]G.[DE]). In the TNKS1/2 interactome, the addition of alternative patterns combined with the canonical ELM-Ankyrin signature increases the number of potential Tankyrase interactors from 32 to 46 (for Alt-1), to 41 (Alt-2) and 41 (Alt-3), respectively. Compare, in this particular case, the enrichment levels for ADP-ribosylated proteins to evaluate each signature (i.e.: presence of the protein in the ADPriboDB 2.0 database [41] ). The proportion of ADP-ribosylated proteins increases from 48 % without filtering (complete interactome) to 65 % (ELM motif), 63% (Alt2) and 84% (Alt3) for each single filtering but for Alt1 (with only 43% of modified proteins). The best enrichment level (82 %) is obtained when combining the alternative sequence motifs Alt2 and Alt3 to the ELM canonical signature, which filter in 45 tankyrases substrates. Among these additional tankyrase partners, several were validated by low-throughput experiments (e.g.: 3BP2, 3BP5, RNF146). It also identified alternative TBM such as the second functional TBM in Pex14. These results suggest considering alternative motifs for tankyrase recognition. Step 5: Selection by ELM classes of SLIMs To focus or hierarchize the search for other motifs, SLiMAn2 also offers the possibility to analyze PPIs for each ELM class type with variable E-values. This mode is quite convenient as it reduces the size of the table of ELM-PFam pairing. The rational for filtering by ELM class type is also based to the different intrinsic properties of the PPIs. Indeed, SLiMs leading to the most stable PPIs (e.g.: SH3) are mainly presented in “Docking” (DOC) and “Ligand”(LIG) class types whereas more transient SLiM-based PPIs are found in “Modification” (MOD), “Cleavage”(CLV) and “Targeting” (TRG). In addition, different SliMs have distinct tendencies for disorder and for folding upon binding. 1. Lower the confidence level from 8 to 2 to search for other likely direct PPis of Tankyrases in the TNKS1/2 interactome. Similar disorder parameters than for the TBM-Ankyrin PPI were used but other filtering can be also set for each ELM-PFam pairs. For some DOC ELM classes (PDZ, SH3, SH2), SLiMAn integrates HSM biophysical prediction, enhancing filtering options [36]. 2. Filter by name with “SH3” the two well-known interaction motifs of TNKS1. In fact, the high-confidence interaction of FNBP1 with TNKS1 does not involve a TBM but a SH3 polyproline motif. Other SH3-based PPIs have lower levels of confidence (5) corresponding to three syntaxins (SNX9, 18 and 33). 3. Use HSM filters to rank the multiple pairing through SH3 motifs. Precisely, 14 proteins are found to potentially recognize 13 SLiMs in TNKS1, from three different classes (LIG, DOC and MOD) and localized in three N-terminal highly disordered regions (1-10; 24-83 and 145-166). By filtering for LIG and DOC ELM class types, it remains 10 potential interactors with FHA (KIF1a, KIF1b, NBN, SLMAP, CHK2) and the already mentioned SH3 (FNBP1, SNX9, SNX18 and SNX33) as well as Metallophosphoesterase (MRE11) domains. From them, KIF1b, MRE11 are already directly connected via TBM motifs to the tankyrases. Of note, apart from FNBP1, none of these potential TNKS1/2 binders are present in TNKS1 or TNKS2 meta-interactomes. However, a favorable SH3 based PPI is also predicted in the TNKS1 meta-interactome between TNKS1 and UBS3B. For TNKS2, similar parameters reveal no direct SLiM-based PPI in the TNKS1/2 interactome as well as in the TNKS2 meta-interactome in agreement with the lack of disordered N-terminal part compared to TNKS1. After similar step-by-step selections for the other SLiM class types (TRG, DEG, CLV), 17 new PPIs, composed of 4 direct SH3 PPIs with TNKS1 and 13 indirect (1 LIG, 2 DEG, 10 TRG) complete the TNKS1/2 interactome. Overall, 10 new direct or indirect SLIM-based partners have been added to the network on top of 42 proteins. Step 6: PTMs and recognition of MOD class SLiMs SLiMs and PTMs are tightly interconnected, although some protein modifications may occur due to chemical reactants with little site specificity (but for the modified residue) such as sulphur oxidation. By essence, most PTM sites should be associated with a SLiM, although not all have been precisely defined already [19]. Only a subset has been written in the "Modification" (MOD) class in the ELM database. These SLiMs are recognized by enzymes most likely through a transient interaction leading to the modification of one residue. Because of the transient nature of these interactions, we may not expect to detect them with most techniques dedicated to interactomics studies. Nevertheless, these modifications are often of uttermost importance for the functioning of macromolecules and need to be identified. Therefore, other validation schemes are required. SLiMAn highlights the residue that should be modified for a given MOD motif. It also highlights any residue if a PTM has been annotated in the PSP+ database (for a small set of model organisms including mainly human and two rodents). A color code and a filtering scheme were set in the new version of SLiMAn (PTM observed or not) to ease the selection of the most favorable MOD SLiMs (Table 1). Unfortunately, while ELM precisely defines the enzyme involved in those modifications (e.g.: MOD_CK2_1), the associated PFam domain comprises a large set of related proteins (e.g.: PF00069 and PF07714 for the majority of protein-kinases). Accordingly, SLiMAn is misled and frequently connects a motif with various enzymes for the same functional class ignoring the actual specificity, as illustrated below. This pairing should be cautiously considered when listed in a SLiMAn output. Filter for MOD by setting disorder: Anchor > 0.4; Short and long Disorder > 0.4; pLDDT score 0,01) and correspond to a high frequency motif sequences. Filter with PSP+ to select motifs for which the critical PTM has been experimentally detected (Table 1). The number of predicted pairings is 5735 among which 244 are supported by experimentally observed PPIs in BioGRID and/or IntAct database. These 244 PPIs involve 5 protein-kinases (TAOK2, CHK2, STK26 and GSK3Aa and GSK3B)b and 21 substrates (containing multiple motifs). In comparison, for the TNKS1 meta-interactome, similar filtering leads to 746 PPIs at confidence level 2 supported by 6 enzymes (STK11, STK36, TINIK, TITIN, PTEN and M4P4) and 28 substrates. For the TNKS2 meta-interactome, 3 kinases (PTEN, STK11 and MK01) and 8 substrates are potentially involved in 102 PPis. Whereas AXIN1, a well-known substrate of tankyrases, is present in the three interactomes, GSK3 kinases are however surprisingly absent, that is probably due to the higher prevalence of direct PPIs in the meta-interactomes, at least for the tankyrases. This observation is also supported by a very low number of indirect PPIs (only 2 for TNKS1) that have been found for the two tankyrases meta-interactomes. Use PSP+ information for additional filtering. Each motif should be scrutinized by navigating between SLiMAn and PSP+ database, which is easily accessible for each annotated motif PTM. Using PSP+ indications, two GSK3 phosphorylation sites on MCL1, already link to the tankyrases, can be validated. Indeed, a transient protein complex might bring together MCL1, tankyrases, AXIN1 and GSK3 kinases. Similarly, TP53 phosphorylation by CHK2 (T18) appear also highly likely. Another illustration for the usefulness of these tools is the highly sophisticated scenario that links GSK3b to AXIN1, that can be anticipated with SLiMAn2. The prior link of GSK3b involved a previously selected docking SLIM-based PPI =. Furthermore, AXIN1 phosphorylation by GSK3b is itself phospho-dependent as the motif must be primed. Despite its low e-value (0,026), the corresponding motif can be validated as information from PSP+ confirms that the motif is phosphorylated at the two required positions (S75 and T79). Overall, 10 MOD PPIs, involving 4 kinases and 5 substrates, have been selected in the TNKS1/2 interactome. Step 7: Using PSP+ to filter other PTM-dependant PPis. Several proteins (Kif1a, Kif1b, CHK2, NBN and SLMAP) in the TNKS1/2 interactome contain an FHA domain which recognizes a phosphorylated threonine in the LIG_FHA motif. As the e-value of the ELM FHA motif is quite high (>0,005), SLiMAn predicts a high number (3505) of putative FHA_1 based PPIs that can be further select : Switch off all ELM classes but the LIG one. Select motifs harboring a modified residue with PSP+ (Table1). The number of SLiMs drops to 265 comprising 155 mono-modified and 110 multi-modified motifs. Apply BioGRID/IntAct databases filters The number of potential FHA_1 motifs involved in known PPIs decreases to 24 (14 mono and 10 multi-modified motifs). Use PSP+ information for additional filtering After a survey of these particular PTM-modified SLIMs, a total of 18 proteins, not interacting by a TBM, can be linked to tankyrases for the TNKS1/2 interactome. Interestingly, 61 % (11/18) of these additional partners are ADP-ribosylated suggesting that they do belong to the TNKS1/2 interactome. Step 8: PTMs modulating SLiMs-based PPIs Beside the MOD class (see step 6), all the other ELM classes may contain motifs involving modified residues. The modification can be mandatory for the recognition (designated as primary/mandatory) such as the phosphorylation of a tyrosine for a SH2 motif or a threonine for a FHA motif (see step 7). Alternatively, it may not be required (designated as secondary/accessory), although it may still interfere with any binding event. Some secondary modifications have been shown to be important functional switches, as mandatory ones are, but secondary PTM can be neutral, favorable or unfavorable to binding [42]. Accordingly, it is important to discriminate these two types of modifications depending on the motif under scrutiny. Hence, SLiMAn indicates, like for the MOD SLiMs, the PTM required for a given ELM motifs but also those detected in the PSP+ database. The color code and a filtering scheme differentiates the various situations (PTM observed or not; required or not), in support of more accurate searching for important motifs requiring PTMs or harboring secondary switches. Switch off all ELM classes but the LIG one. Select “no or accessory PTM” (u U o ). Adjust disorder and confidence thresholds if necessary. Here, strict disorder is set on so that confidence can be very low (1). This selection highlights the presence of a phosphorylation site (S432) in the TRF1 binding motif (LIG_TRFH1) of NBN, although this modification is not required. Experimental data listed in PSP+ indicate that this modification is rather frequent and deleterious for the interaction between NBN and TRF2, a paralogue of TRF1. This illustrates another example of the utility of PSP+ selection tools for PTM analysis. 3.3.3 Interactome network viewing using Cytoscape Once a selection is achieved, it can be visualized in a dedicated Cytoscape window displaying the corresponding network. The progression of the analysis is illustrated in Figure 7. By default, all the proteins harboring a validated pairing (ex.: TNKS-FNBP1) are connected. Each protein is shown as a purple rectangle that contains a green hexagon representing a PFam domain. A set of different colors characterizes each type of link between two protein partners: ELM motif/PFam domain pair, BioGRID or IntAct connections as well as HSM scoring. This potentially emphasizes dense sub-networks that usually correspond to macromolecular assemblies (based often also on domain-domain interactions) and/or singletons (not shown here) that may require further inspection. The latter may belong to the studied interactome - through unknown interaction - or correspond to spurious preys. Proteins can be rearranged within this window to better show these networking features. This may provide clues to resume searching for ELM/PFam pairing by focusing on particular proteins. This analysis is complementary to the analysis summarised in the main table. For example, in Figure 7 several protein complexes, like STRIPAK (striatin-interacting phosphatase and kinase), MRN (MRE11, RAD50, NBN) that were initially disconnected or only lightly connected to the Tankyrases are linked (red arrows) at the end of the analysis. The addition of alternative TBMs appear to directly link to MRE11 as well as the MRN complex to tankyrase. Furthermore, apparently indirect binders like MCL1 and CHK2 are also directly linked to Tankyrases. 3.4 Structural model prediction of a given PPI Finally, SLiMAn can check for the presence of related complexes in the PDB. It requires a folded domain matching the referenced PFam in the structure as well as a peptide (less than 35 residues) matching the desired ELM motif, in order to be considered a potential template for comparative modeling. For each class in ELM, an extraction from the PDB led to suitable templates for almost half of the possible ELM/PFam pairs, for a total of 5325 extracted templates. SLiMAn gives first access to an interactive webpage (SLiM-ID) to handle paired sequence alignments for both the motif and the PFam domain. Then, comparative modeling can be submitted and the results can be visualized on a second webpage (SLiM-IM). Models can be downloaded for further study. They can also be tagged as validated or discarded to further assist the user in defining the interaction network in the main result page. 3.4.1. Sequence to structure alignments If pre-extracted templates are available for comparative modeling of a given ELM-PFam pairing, access the sequence alignment interface, which is presented under the SLiM-ID environment (see Figure 8A). In SLiM-ID, first a summary of the paired ELM motif and the matched PFam domain sequences is highlighted on the top of the page. In addition, double alignments (of the motif and the domain sequences) with potential templates of complexes are performed using two different tools, MAFFT [43] and BLAST [44]. ELM motif and PFam domain boundaries are directly extracted from their respective databases. If needed, manually edit them to re-compute alignments with re-defined sequence boundaries (Figure 8B). The ELM motif is generally well aligned to the corresponding peptide within the template thanks to the conserved ELM signature. The PFam domain may be aligned on much more divergent templates (< 35% of sequence identity). This might indicate that the overall fold is conserved but this might not include the binding site. In case of too low sequence similarity, cautiously discard the match. In this situation, use alternative approaches to predict the desired complex (e.g.: using HADDOCK [45] pepATTRACK[46] or AlphaFold [28]). Because SLiMAn requires a perfect match between the ELM motif regular expression to detect the peptide in a template, it may sometimes miss suitable templates. Again, alternative routes to modeling are necessary in that case. To guide the selection of the most suitable templates, several alignment metrics are computed: sequence identity (%ident), query coverage (%QueryCoverage), template coverage (%TemplateCoverage) and a conserved contact score (CCS and %CCS). At any time, the alignment table can be sorted according to one of these metrics (see Note 6). In addition, to facilitate the visual inspection of the alignments, residues belonging to the peptide-protein interface are coloured (green, orange and red) according to the contact distances (of 4.0, 5.5 and 7.0 Angstroms respectively). Before launching the modelling process, select the desired entries to serve as templates for comparative modelling, according to two options: a custom selection by checking the box on the left of each alignment in the table, by using the automated selection tools, to select top 5, non-redundant PDB, or all available templates. Once, the alignments have been optimized, validated and at least one template is selected (Figure 8), click the “Launch modelling” button to start the comparative modelling process using SCWRL3.0 [47]. 3.4.2. Structure Modeling During the modeling process (approximately a few seconds per model) of the complex by SCWRL3.0, identical side-chains are kept fixed during the optimization first of the domain (in presence of the peptide from the original template), then of the peptide (in presence of the modeled domain). The completion of modelling, triggers a re-direction to the SLiM-IM environment. In the example, the 3Dmol.js viewer is used to display the complexes (Figure 9) [48]. In addition, an interaction analysis is performed by BINANA [49], highlighting favourable hydrophobic contacts (grey spheres) and hydrogen bonds (black arrows) as well as potential steric clashes (red spheres). At the bottom of the page is displayed a table containing the various information and intermediate models generated along the process. This table holds the original PDBid, its extracted SLiM-domain templates, model of the domain, model of the motif and the reconstituted complex. Click on the displayed structures to visualized or downloaded for local analysis. “Validate” or “Discard” models in the last column of the Table, based on own expertise. Click on the “Save selection” button, to erase discarded models and include validated models in the hit prediction table in SLiM-IP. The latter will be easily searchable using the “SLiMIM valid models” filter. 4. Notes 1. Pfam database is now integrated in Interpro consortium (https://www.ebi.ac.uk/interpro/). 2. SLiMAn adjusts its selection filters to bring a list of pairing neither too huge nor too tiny (if possible). These parameters should be adapted to each particular query and vary during an actual survey(see 3.3). 3. In IntAct, directional information regarding the actual bait and prey is available in most experiments. For HuRI, multiple directional two-hybrid assays can be listed as well. 4. The GSK3b docking site in AXIN1 corresponds to a folded helix with low disorder scores by IUPred and AlphaFold while the phosphorylation motif in TP53 has a high E-value (0.027). These connections would not appear with standard parameters of SLIMan (E-value 0.4). 5. As “Create your own RegEx” option offer the possibility to deal with the stringency ot a given motif, Non-canonical TBMs, R.{5}G and R.{10}G, could be also evaluated. 6. We strongly advise users to mainly focus on both the query coverage and %CCS, as the first is representing the percentage of query amino acids that will be modelled while the latter corresponds to the percentage of conserved contacts between the motif and the domain in the future comparative model. References Stumpf MPH, Thorne T, Silva E de, et al (2008) Estimating the size of the human interactome. Proc Natl Acad Sci (U S A) 105:6959–6964 Braakman I and Hebert DN (2013) Protein folding in the endoplasmic reticulum. Cold Spring Harb Perspect Biol 5:a013201 Bozaykut P, Ozer NK, and Karademir B (2014) Regulation of protein turnover by heat shock proteins. Free Radic Biol Med 77:195–209 Torres-Quesada O, Mayrhofer JE, and Stefan E (2017) The many faces of compartmentalized PKA signalosomes. Cell Signal 37:1–11 Huttlin EL, Ting L, Bruckner RJ, et al (2015) The BioPlex Network: A Systematic Exploration of the Human Interactome. Cell 162:425–440 Huttlin EL, Bruckner RJ, Paulo JA, et al (2017) Architecture of the human interactome defines protein communities and disease networks. Nature 545:505–509 Cafarelli TM, Desbuleux A, Wang Y, et al (2017) Mapping, modeling, and characterization of protein-protein interactions on a proteomic scale. Curr Opin Struct Biol 44:201–210 Ruwolt M, Piazza I, and Liu F (2023) The potential of cross-linking mass spectrometry in the development of protein-protein interaction modulators. Curr Opin Struct Biol 82:102648 Paiano A, Margiotta A, De Luca M, et al (2019) Yeast Two-Hybrid Assay to Identify Interacting Proteins. Curr Protoc Protein Sci 95:e70 Luck K, Kim D-K, Lambourne L, et al (2020) A reference map of the human binary protein interactome. Nature 580:402–408 Benz C, Ali M, Krystkowiak I, et al (2022) Proteome-scale mapping of binding sites in the unstructured regions of the human proteome. Mol Syst Biol 18:e10584 Bao Y, Pan Q, Xu P, et al. (2023) Unbiased interrogation of functional lysine residues in human proteome. Mol Cell. 83:4614-4632 Yu C and Huang L (2023) New advances in cross-linking mass spectrometry toward structural systems biology. Curr Opin Chem Biol 76:102357 Edwards AM, Kus B, Jansen R, et al (2002) Bridging structural biology and genomics: assessing protein interaction data with known complexes. Trends Genet 18:529–536 Jansen R, Yu H, Greenbaum D, et al (2003) A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data. 302:449–453 Mayer BJ (2015) The discovery of modular binding domains: building blocks of cell signalling. Nat Rev Mol Cell Biol 16:691–698 Kumar M, Michael S, Alvarado-Valverde J, et al (2022) The Eukaryotic Linear Motif resource: 2022 release. Nucleic Acids Research 50:D497–D508 Tompa P, Davey NE, Gibson TJ, et al (2014) A million peptide motifs for the molecular biologist. Mol Cell 55:161–169 Kitamura N and Galligan JJ (2023) A global view of the human post-translational modification landscape. Biochem J 480:1241–1265 Davey NE, Travé G, and Gibson TJ (2011) How viruses hijack cell regulation. Trends Biochem Sci 36:159–169 Gemovic B, Sumonja N, Davidovic R, et al Mapping of Protein-Protein Interactions: Web-Based Resources for Revealing Interactomes. 26:3890–3910 Szklarczyk D, Kirsch R, Koutrouli M, et al (2023) The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res 51:D638–D646 Mirela Bota P, Hernandez AC, Segura J, et al (2023) CM2D3: Furnishing the Human Interactome with Structural Models of Protein Complexes Derived by Comparative Modeling and Docking. J Mol Biol 435:168055 Oughtred R, Rust J, Chang C, et al (2021) The BioGRID database: A comprehensive biomedical resource of curated protein, genetic, and chemical interactions. Protein Sci 30:187–200 Del Toro N, Shrivastava A, Ragueneau E, et al (2022) The IntAct database: efficient access to fine-grained molecular interaction data. Nucleic Acids Res. 50:D648-D653 Reys V and Labesse G (2022) SLiMAn: An Integrative Web Server for Exploring Short Linear Motif-Mediated Interactions in Interactomes. J Proteome Res 21:1654–1663 Zhou X, Hu J, Zhang C, et al (2019) Assembling multidomain protein structures through analogous global structural alignments. Proc Natl Acad Sci (U S A). 116:15930-15938 Evans R, O’Neill M, Pritzel A, et al (2022), Protein complex prediction with AlphaFold-Multimer, https://www.biorxiv.org/content/10.1101/2021.10.04.463034v2 aradi M, Anyango S, Deshpande M, et al (2021) AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res 50:D439–D444 Li X, Han H, Zhou M-T, et al (2017) Proteomic Analysis of the Human Tankyrase Protein Interaction Network Reveals Its Role in Pexophagy. Cell Reports 20:737–749 Hornbeck PV, Chabra I, Kornhauser JM, et al (2004) PhosphoSite: A bioinformatics resource dedicated to physiological protein phosphorylation. Proteomics 4:1551–1561 Shannon P, Markiel A, Ozier O, et al. (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13:2498-2504. Pratt H, Weng Z. LogoJS: a Javascript package for creating sequence logos and embedding them in web applications. Bioinformatics. 2020 Jun 1;36(11):3573-3575. Mészáros B, Erdos G, and Dosztányi Z (2018) IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding. Nucleic Acids Res 46:W329–W337 Jumper J, Evans R, Pritzel A, et al (2021) Highly accurate protein structure prediction with AlphaFold. Nature 596:583–589 Cunningham JM, Koytiger G, Sorger PK, et al (2020) Biophysical prediction of protein-peptide interactions and signaling networks using machine learning. Nat Methods 17:175–183 Guettler S, LaRose J, Petsalaki E, et al (2011) Structural basis and sequence rules for substrate recognition by Tankyrase explain the basis for cherubism disease. Cell 147:1340–1354 Koirala S, Klein J, Zheng Y, et al (2020) Tissue-Specific Regulation of the Wnt/β-Catenin Pathway by PAGE4 Inhibition of Tankyrase. Cell Rep 32:107922 Morrone S, Cheng Z, Moon RT, et al (2012) Crystal structure of a Tankyrase-Axin complex and its implications for Axin turnover and Tankyrase substrate recruitment. Proc Natl Acad Sci (U S A). 109:1500-1505 DaRosa PA, Klevit RE, and Xu W (2018) Structural basis for tankyrase-RNF146 interaction reveals noncanonical tankyrase-binding motifs. Protein Sci 27:1057–1067 Ayyappan V, Wat R, Barber C, et al (2021) ADPriboDB 2.0: an updated database of ADP-ribosylated proteins. Nucleic Acids Research 49:D261–D265 Gogl G, Jane P, Caillet-Saguy C, et al.. (2020) Dual Specificity PDZ- and 14-3-3-Binding Motifs: A Structural and Interactomics Study. Structure. 28:747-759 Katoh K, Misawa K, Kuma K, et al (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 30:3059–3066 Altschul SF, Gish W, Miller W, et al (1990) Basic local alignment search tool. J Mol Biol 215:403–410 de Vries SJ, van Dijk M, and Bonvin AMJJ (2010) The HADDOCK web server for data-driven biomolecular docking. Nat Protoc 5:883–897 de Vries SJ, Rey J, Schindler CEM, et al (2017) The pepATTRACT web server for blind, large-scale peptide-protein docking. Nucleic Acids Res 45:W361–W364 Wang Q, Canutescu AA, and Dunbrack RL (2008) SCWRL and MolIDE: computer programs for side-chain conformation prediction and homology modeling. Nat Protoc 3:1832–1847 Rego N and Koes D (2015) 3Dmol.js: molecular visualization with WebGL. Bioinformatics 31:1322–1324 Young J, Garikipati N, and Durrant JD. (2022) BINANA 2: Characterizing Receptor/Ligand Interactions in Python and JavaScript. J Chem Inf Model. 62:753-760 Tables Table 1 is available in the Supplementary Files section. Additional Declarations The authors declare no competing interests. Supplementary Files FiguresMetMolBiol10.png Table 1: PhosphoSite+ filters codes Various schemes allow selection of SLiMs according to their PTM content and their requirement according to ELM. Cite Share Download PDF Status: Posted Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-3973092","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Method Article","associatedPublications":[],"authors":[{"id":273905997,"identity":"e0f2bbba-137e-4273-9220-d5858e6d122b","order_by":0,"name":"Alexandre Mezghrani","email":"","orcid":"","institution":"Centre de Biologie Structurale (CBS), CNRS, INSERM, Univ. Montpellier, Montpellier, France","correspondingAuthor":false,"prefix":"","firstName":"Alexandre","middleName":"","lastName":"Mezghrani","suffix":""},{"id":273905998,"identity":"948a3f7e-7b19-4010-8a90-f6f1458a6f15","order_by":1,"name":"Juliette SIMON","email":"","orcid":"","institution":"Centre de Biologie Structurale (CBS), CNRS, INSERM, Univ. Montpellier, Montpellier, France","correspondingAuthor":false,"prefix":"","firstName":"Juliette","middleName":"","lastName":"SIMON","suffix":""},{"id":273905999,"identity":"b51cf841-b532-43c9-a345-17a9f95a13b3","order_by":2,"name":"Victor Reys","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAvklEQVRIiWNgGAWjYLCCDwwHYEw54nQwzkBoMSZOCzMPSVrM23uMP9vU3LFnYD977OMPBoN8glpkzpwxk8459iyxgScveTYPg4FlAyEtEhI5Zsy5DYcTGCR4jJkZGP4YELRFQv6N8WfLhsP2IC2MQIcRoUWCx0CaseEwYwNQCwMPUVp40soke4B+aePJMWbmMSBGC/vhzR9+AEOMn/0M0GEVRGhhYOCAKGIDk8RoYGBgf0CUslEwCkbBKBjBAABG6zAsYNwHggAAAABJRU5ErkJggg==","orcid":"","institution":"Centre de Biologie Structurale (CBS), CNRS, INSERM, Univ. Montpellier, Montpellier, France","correspondingAuthor":true,"prefix":"","firstName":"Victor","middleName":"","lastName":"Reys","suffix":""},{"id":273906000,"identity":"4906a15d-8ade-465d-8d01-9863741b18ef","order_by":3,"name":"Gilles Labesse","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAA0UlEQVRIiWNgGAWjYBACPgYeMC3HIAETYj6AXwsbVIsxQgtbAnFaEhuI18J+9uDDH39s0vtntz9g+Lnjnj0DG+8D/Fp48pKNedvScmfcOWPA2HumOLGBjd2AgMNyzKQZGw7nbpDIYWBmbEtIYJBvI+Aw/jfmP3/8+Z9uIJH+AKQF6DA2AlokcswYeNgOJBhIJBiAtDA2ENbyLlmaty3ZEOSXg71tCYlthLTw8+ce/Pjjj508/+z2hw9+Ah3GT0gLCjgAtpcEDaNgFIyCUTAKcAAAHVQ7chzr260AAAAASUVORK5CYII=","orcid":"","institution":"Centre de Biologie Structurale (CBS), CNRS, INSERM, Univ. Montpellier, Montpellier, France","correspondingAuthor":true,"prefix":"","firstName":"Gilles","middleName":"","lastName":"Labesse","suffix":""}],"badges":[],"createdAt":"2024-02-20 15:00:42","currentVersionCode":1,"declarations":{"humanSubjects":false,"vertebrateSubjects":false,"conflictsOfInterestStatement":false,"humanSubjectEthicalGuidelines":false,"humanSubjectConsent":false,"humanSubjectClinicalTrial":false,"humanSubjectCaseReport":false,"vertebrateSubjectEthicalGuidelines":false},"doi":"10.21203/rs.3.rs-3973092/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-3973092/v1","draftVersion":[],"editorialEvents":[],"editorialNote":"","failedWorkflow":false,"files":[{"id":51526591,"identity":"4fc9ce11-3f49-43b6-a214-be6158b018c1","added_by":"auto","created_at":"2024-02-23 05:55:17","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":2565478,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003ePipeline of SLiMAn2.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe flowchart of SliMAn 2.0 is shown with the databases and tools used at the various steps. Results obtained at the various steps are also displayed here and discussed in the text.\u003c/p\u003e","description":"","filename":"FiguresMetMolBiol01.png","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/e9fb786496ae437f3cc1ea92.png"},{"id":51526300,"identity":"0e47c3a0-8dfe-44d3-a25c-99f3134cc8b1","added_by":"auto","created_at":"2024-02-23 05:47:17","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":1574562,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eQuery submissions.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eA) For a novel interactome: Upload (Uniprot file) or manually enter a list of Uniprot names/codes of proteins (hereafter Uniprot list) within the appropriate menu in SLiMAn frontpage (\u003ca href=\"https://sliman2.cbs.cnrs.fr/\"\u003e\u003cu\u003eF\u003c/u\u003e\u003c/a\u003e\u003cu\u003eigure 2A\u003c/u\u003e\u003ca href=\"https://sliman2.cbs.cnrs.fr/SLIMAN2/limip_index.py\"\u003e).\u003c/a\u003e\u003c/p\u003e\n\u003cp\u003eB) For a curated (meta-)interactome from biological databases: Enter a UniProt accession number (e.g.: Q9H2K2) or the corresponding UniProt identifier (here: TNKS1_HUMAN; see Figure 2B) for the requested protein in the section “PPI extension”. As presented here, known interactants of Tankyrase 1 (TNSK1) are extracted automatically from BioGRID and IntAct databases.\u003c/p\u003e\n\u003cp\u003eC) Show a list of interactants retrieved by SliMAn.\u003c/p\u003e","description":"","filename":"FiguresMetMolBiol02.png","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/29b50fcfd53e1a2776399736.png"},{"id":51526305,"identity":"9c44a768-53f4-48ce-894a-64b80c9b374f","added_by":"auto","created_at":"2024-02-23 05:47:18","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":933497,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSummary of query and results on top of the main page.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe upper part of the front page recapitulates the query name and date as well as the number of proteins under study. A table indicates the number of ELM motifs and Pfam domains detected and connected by SliMAn. A set of outputs are also listed.\u003c/p\u003e","description":"","filename":"FiguresMetMolBiol03.png","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/7d4732a0198f68f9a42cac07.png"},{"id":51526302,"identity":"51550866-9295-49eb-8735-74fcfdd33272","added_by":"auto","created_at":"2024-02-23 05:47:17","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":1482808,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eParameter panels from the main page.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eA-B) Left and right part of the middle section of the front page displays the various parameters and selection scheme to be used interactively to highlight the desired ELM-PFam pairings.\u003c/p\u003e","description":"","filename":"FiguresMetMolBiol04.png","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/f49156dc3b9fdd524c5e460a.png"},{"id":51526304,"identity":"26856c6d-ce0f-4f79-8b40-553a27f12555","added_by":"auto","created_at":"2024-02-23 05:47:18","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":1012502,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eExample of a main result table of SLiMAn webserver.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe ELM-PFam pairings are displayed in the main table. The color code of the confidence levels is shown above the list of ELM motifs selected by SliMAn and the corresponding PFam domain.\u003c/p\u003e","description":"","filename":"FiguresMetMolBiol05.png","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/90d76a6b2696d3d2b5c4931b.png"},{"id":51526303,"identity":"1a4eb94c-7f77-40e3-b7a7-0bc390ca6ca9","added_by":"auto","created_at":"2024-02-23 05:47:18","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":3022620,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eSpecific information (SLiMIP Hits) relative to a given PPI.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eFor each ELM-PFam pair a pop-up window shows the various specifications and correspondng parameters as well as links to dedicated PubMed searches and the corresponding information from the BIOGRID and IntAct database.\u003c/p\u003e","description":"","filename":"FiguresMetMolBiol06.png","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/677d82f34ada1e2465635388.png"},{"id":51526306,"identity":"2473b6e3-585e-41e1-b4a3-47251f1b544a","added_by":"auto","created_at":"2024-02-23 05:47:18","extension":"png","order_by":7,"title":"Figure 7","display":"","copyAsset":false,"role":"figure","size":6429317,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eCytoscape view of SLiM-based PPIs in the TNKS1/2 interactome.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eExample of an output in Cytoscape highlighting the various links connecting baits and preys. SliMAn shows the annotated domains in each protein displayed and the connections corresponding to ELM motifs as well as experimental connections extracted from BioGRID and IntAct. The protein domains and the links are clickable to get more information regarding the associated information (with links to Uniprot and Pubmed for the given protein(s) and experiments).\u003c/p\u003e","description":"","filename":"FiguresMetMolBiol07.png","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/91412bdba8b40850b02b53ea.png"},{"id":51526307,"identity":"003fbff6-2385-46fe-9e49-67269db1b875","added_by":"auto","created_at":"2024-02-23 05:47:18","extension":"png","order_by":8,"title":"Figure 8","display":"","copyAsset":false,"role":"figure","size":1102745,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eExample of template alignment for structural modeling.\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eA) Protein sequences of CASC3 and TNKS1. Extracted motif and domain sequences boundaries are highlighted in green. Note the PFam ankyrin domain segmentation (144-152) does not correspond in this case to the entire TNKS1 binding domain. B) Manual re-definition of the sequence boundary (147-447) for TNKS1 domain. The new segmentation performed by the user is notified at different levels (“User defined segmentation”, “Modify Domain Segmentation” and “Update alignment with current domain segmentation”).\u003c/p\u003e","description":"","filename":"FiguresMetMolBiol08.png","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/5f082a54a1f7cd8ac30915b1.png"},{"id":51526308,"identity":"e305212b-3517-44e3-a4ad-f8d53f901f15","added_by":"auto","created_at":"2024-02-23 05:47:18","extension":"png","order_by":9,"title":"Figure 9","display":"","copyAsset":false,"role":"figure","size":5861668,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eExample of molecular modeling\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eComparative modeling of the ANK motif in complex with TNSK1 ankyrin (using PDB-5JHQ) as templates). The graphical representation is displayed in the SLiM-IM page and generated by the \u003cem\u003e3DmolJS\u003c/em\u003eapplet. Contacts analysis are detected by \u003cem\u003eBINANA\u003c/em\u003e, and displayed on the model. Side-chains within and around the bound ELM motif are shown in sticks. Favorable van der Waals interactions are shown as grey disks and hydrogen bonding as arrows. Clashes are shown as red spheres. The apparent quality of the complex shown here suggests that the divergent TBM in CASC3 fits well in the binding site of TNKS1.\u003c/p\u003e","description":"","filename":"FiguresMetMolBiol09.png","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/74b561727fdb3857b240cbbc.png"},{"id":51526976,"identity":"b0c0fa08-7eb9-42cb-aa92-0800e81dc701","added_by":"auto","created_at":"2024-02-23 06:03:21","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":2527740,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/e079575d-8337-4d48-bae6-63e1ea0eaf68.pdf"},{"id":51526299,"identity":"e586bf73-2efb-45b2-bab6-fc3b25a2faa3","added_by":"auto","created_at":"2024-02-23 05:47:17","extension":"png","order_by":1,"title":"","display":"","copyAsset":false,"role":"supplement","size":1810183,"visible":true,"origin":"","legend":"\u003cp\u003e\u003cstrong\u003eTable 1: \u003c/strong\u003e\u0026nbsp;\u003cstrong\u003ePhosphoSite+ filters codes\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eVarious schemes allow selection of SLiMs according to their PTM content and their requirement according to ELM.\u003c/p\u003e","description":"","filename":"FiguresMetMolBiol10.png","url":"https://assets-eu.researchsquare.com/files/rs-3973092/v1/8eef5b67aa7e0a56dceae640.png"}],"financialInterests":"The authors declare no competing interests.","formattedTitle":"\u003cp\u003eDetection and analysis of Short Linear Motif based protein-protein interactions with SLiMAn2 web server\u003c/p\u003e","fulltext":[{"header":"1. Introduction","content":"\u003cp\u003eIn all organisms, from virus to mammals, proteins, throughout their lifespan, interact with a large number of partners, especially other proteins. The number of protein-protein interactions (PPIs) can be reasonably estimated to be in the order of a million in each eukaryotic cell [1]. At the basis of life plasticity, the vast majority of an interactome is mainly constituted of transient PPIs. For example, for the 20% of proteins that are secreted, even before they reach their proper location, they have transiently interacted with dozens of different partners along the secretion pathway, post-translationally shaping their molecular and structural characteristics [2]. Similarly, cellular protein degradation systems involve sophisticated machinery in which targeted proteins interact transiently with multiple proteins [3]. Many signaling cascades rely also on such weak PPIs [4]. As these interactions can be transient, with low affinity and often spanning a few percent of the total of gene products, they have been largely underestimated in cell biology and hard to characterize.\u003c/p\u003e\n\u003cp\u003eDuring the last two decades, substantial technical progress has been made in interactomics with high-throughput Affinity Purification Mass Spectrometry (AP-MS) as well as with large scale yeast two-hybrid system allowing the identification of around half-a-million of interactions for the ~20000 human proteins [5\u0026ndash;7]. Whereas classical AP-MS is well suited to identify high-affinity interactions involved in stable core protein complexes, the detection of true transient protein-protein interactions is more challenging even in presence of cross-linkers [8]. Another big challenge in AP-MS is discrimination between direct or indirect interactions in a proteome. In contrast, two-hybrid systems focus on binary interactions, and appear more suitable for transient protein interaction detection with the disadvantage of being reductionist and not perfectly physiological for non-yeast proteins [9]. Consequently, this method has a high rate of false positive and negative hits although careful and redundant strategy can help improving these statistics [10]. Low-throughput studies in the test tube, like complex reconstitution, are time consuming, biased, and often non-physiological but highly informative at the molecular and structural levels. New methodologies emerged to unravel undiscovered PPI such as dedicated phage-display systems [11], systematic directed mutagenesis using Crispr-Cas9 technology [12] and massive cross-linking (diazirine photo-crosslinking and MS) [13]. Despite the different limits mentioned above, systematic studies with each technique for a defined macromolecule pave the way to a complete human interactome resolution. Combining the various outputs to gain better insights is useful but can be complex despite remarkable early attempts [14, 15].\u003c/p\u003e\n\u003cp\u003eNumerous molecular and structural studies have highlighted that many PPIs involve a folded domain recognizing short linear motifs (SLiMs) [16, 17]. Several hundred of SLiMs are playing key roles in all cellular processes including cell signaling, metabolism and as well as protein trafficking [18]. SLiMs are also at the basis of all protein post-translational modifications (PTMs) like proteolysis, phosphorylation, ubiquitination, glycosylation and others, as protein-modifying enzymes like protein kinases and proteases usually recognize only a short segment of consecutive amino-acids [19]. SLiMs are often found in flexible loop regions for single globular proteins and in intrinsically disordered segments for multidomain proteins [20]. Not surprisingly, more than 20 of the 340 ELM SLiMs registered in the ELM database are localized at the N- or C-terminal protein parts (http://elm.eu.org/).\u003c/p\u003e\n\u003cp\u003eDifferent resource databases gather interactomic data [21]. Most of them, like STRING [22], CM2D3 [23], BioGRID [24] and IntAct [25] retrieve biochemical and genetic data for a given gene product to show connection to putative partners \u0026ndash; or interactants \u0026ndash; and some systems score their reliability [22]. However, in most of them, the precise molecular and structural features of each PPI are not directly accessible. Furthermore, despite the high amount of information collected, various connections are missed even in STRING, as previously noted [26]. Tools dedicated to domain-domain interactions based on comparative modeling have been described [27] but there are limits to the number of complexes one can model. However, the newly developed artificial-intelligence (AI)-based approaches, such as AlphaFold multimer, provide a promising alternative to standard homology-based modeling [28]. Furthermore, the detection and modeling of SLiM-based interactions is hampered by the elusive nature of SLiMs. The SLiMAn webserver has been developed to integrate interactomics data in order to identify SLiMs and their recognition patterns as well as to perform comparative modeling (when possible) [26]. Here, we highlight enhanced features recently added in SLiMAn2 with examples run in analytical and discovery modes.\u003c/p\u003e"},{"header":"2. Material","content":"\u003cp\u003eA primary version of SLiMAn has been already described \u003cstrong\u003e\u003cem\u003e[\u003c/em\u003e\u003c/strong\u003e26\u003cstrong\u003e\u003cem\u003e]\u003c/em\u003e\u003c/strong\u003e. Since then, the source code has been completely rewritten, new features were added and the web server rendering now fully exploit JavaScript for an enhanced interactivity and user experience. New tools or databases have been added in the version 2.0 of SLiMAn (available at: https://sliman2.cbs.cnrs.fr/ and described in more detail elsewhere (manuscript in preparation))(Figure 1). Among them, the IntAct database that brings additional experimental pairing while AlphaFold predictions \u003cstrong\u003e\u003cem\u003e[\u003c/em\u003e\u003c/strong\u003e29\u003cstrong\u003e\u003cem\u003e]\u0026nbsp;\u003c/em\u003e\u003c/strong\u003e(through the huge database of systematically pre-computed structural models) increase confidence in the boundaries of folded and disordered regions in each protein (Figure 1). In addition, fast connection to PubMed \u0026nbsp;accelerate screening of published material for a given pairing or protein (http://www.ncbi.nlm.nih.gov/pubmed). Other additional features are also described along their use in this chapter (Figure 1).\u003c/p\u003e"},{"header":"3. Methods","content":"\u003cp\u003e3.1 Query types.\u003c/p\u003e\n\u003cp\u003eSLiMAn can be interrogated in two ways depending on the data available. Either a given interactome has been obtained (or selected from any source of data) and the corresponding list of proteins can be directly submitted to the webserver or the submitted entry corresponds to only one unique protein. In the latter case, SLiMAn interrogates BioGRID and/or IntAct to retrieve a (meta-)interactome. This constitutes a list of putative interactants to be submitted directly and analyzed using the very same webserver. Once the query list is submitted, the same workflow applies to the data regardless of their origin (Figure 1). In the case an original interactome is submitted, the data from BioGRID and/or IntAct can serve as a validation, as illustrated previously [26] and further in this protocol.\u003c/p\u003e\n\u003cp\u003eTo illustrate the different properties of SLiMAn2 for interactomic data analysis, a focused proteomic study on tankyrase 1 and 2 published by Li and colleagues in 2017, name hereafter the TNKS1/2 interactome is proposed [30]. The information extracted from this analysis is compared to the one resulting from performing a parallel analysis of the meta-interactome extracted from BioGRID and IntAct for the same two tankyrases. The goal is to define the molecular features of the different PPIs present in the interactomes from distinct sources and to reveal potential SLiM-based interactions involved in different functional networks. We describe below the step-by-step method used to partially decipher a SLiM-based protein network.\u003c/p\u003e\n\u003cp\u003e3.1.1. Querying a novel interactome\u003c/p\u003e\n\u003cp\u003e1. Collect identifiers (or accession codes) from Uniprot for all the partners of the TNKS1/2 \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp;interactome.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e2. \u0026nbsp; Start a new project on the webserver by submitting the list of names \u0026ndash; separated by commas \u0026ndash; on, the front page (Figure 2A). A project name can be input in the dedicated window.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e3. \u0026nbsp; Press the button \u0026ldquo;Find Interactions\u0026rdquo; just below and SLiMAn will start its analysis (see below 3.2).\u003c/p\u003e\n\u003cp\u003e3.1.2. Quick start from a given protein.\u003c/p\u003e\n\u003cp\u003eAlternatively:\u0026nbsp;\u003c/p\u003e\n\u003col\u003e\n \u003cli\u003eSubmit the Uniprot names for TNKS1 or 2 (TNKS1_HUMAN or TNKS2_HUMAN, respectively) to the menu \u0026ldquo;PPI Extension\u0026rdquo; on the SLiMAn front page (Figure 2B). In this case, SLiMAn queries BioGRID and IntAct that contain curated protein interactomic data for most model organism species.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003ePress \u0026ldquo;Find Interactants\u0026rdquo; and SLiMAn-2 will retrieve all the interactomic data from these two databases (see 3.1.3).\u0026nbsp;\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003e3.1.3 Filtering putative interactants\u003c/p\u003e\n\u003cp\u003eFrom a given UniProt protein name (or code), SLiMAn rapidly extracts the putative partners listed in BioGRID and/or IntAct. A resulting webpage interactively enlists, by default, the proteins associated with the query in any of those two databases. This list can range from very few to up to a thousand of protein partners. SLiMAn can manage more than one hundred of interactors but not a thousand in the current version. However, it is likely too difficult to survey too many partners at once. Hence, the number of interactants to study, can be filtered using the \u0026laquo;Parameters\u0026raquo; section shown just above the protein list. Data from only one database can be used instead or, on the contrary, one can focus on the intersection of the two databases, BioGRID and IntAct, for higher confidence. Note that there is a significant overlap between the two databases and in that case, this redundancy cannot be seen as a cross-validation. Otherwise, as the data from BioGRID are separated in low- and high-throughput classes while IntAct data are split in general data and those from HuRI (http://www.interactome-atlas.org/; [10]), one can reduce further and tune the query list before submission. For example, only low-throughput data from the BioGRID database can be selected or, instead, only HuRI proteomic data (direct PPIs) within IntAct data. For higher confidence, filtering in interactants detected by both low- and high-throughput methods and present in both databases, is a good choice, although at the expense of the number of partners. Each time, SLiMAn updates the list of proteins to survey. One can hoover the mouse on the query name to read the number of interactants enlisted.\u003c/p\u003e\n\u003cp\u003eOnce parameters are chosen, press the button \u0026ldquo;Quick launch\u0026rdquo; to launch SLiMAn analysis.\u003c/p\u003e\n\u003cp\u003e3.2 Main outputs.\u003c/p\u003e\n\u003cp\u003eWhatever the type of data (provided list of protein or meta-data extracted for one given protein), SLiMAn will search for ELM motifs as well as the corresponding PFam domains (see Note 1) within the submitted protein sequences. To filter only relevant information, orphan or unpaired ELM motifs and PFam domains are discarded. This dramatically reduces the list of ELMs amenable to analysis, in contrast to a direct interrogation of the ELM database. The motifs and domains filtered in, are shown on the main table for further interactive analysis. This table highlights all the putative pairing between an ELM motif and the corresponding domain. Numerous parameters can be used to filter in or out, more or less protein pairs as illustrated below.\u003c/p\u003e\n\u003cp\u003e3.2.1. General view.\u003c/p\u003e\n\u003cp\u003eIn its upper part, the main result page recapitulates the query parameters (query name, number of partners, \u0026hellip;). After an automatic setting of the filters (see Note 2), \u0026nbsp;it tabulates the number of ELM motifs, PFam domains and number of proteins containing (or not) such motifs and/or domains (Figure 3). It also provides links to useful outputs for subsequent studies.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eIn the central part of the webpage, eight modules of different filters are present with the corresponding buttons, cursors or digits for selection (Figure 4). The \u0026ldquo;ELM\u0026rdquo;, \u0026ldquo;Disorder\u0026rdquo;, \u0026ldquo;HSM\u0026rdquo; and \u0026ldquo;PSP+\u0026rdquo; filters allow to manage structural and molecular parameters of PPis (Figure 4A). The second part of the filter panel is dedicated to the selection (\u0026ldquo;PPI database\u0026rdquo;, \u0026ldquo;SLiMan\u0026rdquo; (level of confidence),\u0026rdquo;visual\u0026rdquo;) and display forms (\u0026ldquo;visual\u0026rdquo;) of PPis (Figure 4B). Several examples of the roles of this interface in interactive analysis of an interactome is devellop in more details below (see 3.3).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eIn the lower part, a table is provided in which the ELM-PFam pairing computed by SLiMAn are highlighted. In the top row, PFam domains are listed associated with the UniProt identifier of the proteins containing them. In the left column, SLiMs corresponding to ELM entries, are listed in association with the UniProt identifier of the proteins containing them. Molecular features for each SLiM are detailed, namely, its ELM class, the corresponding regular expression and sequence motif as well as its location in the studied protein. If a PTM is annotated in the motif, as extracted from PhosphoSitePlus (herein PSP+; [31]), it is also highlighted (see below for more details).\u003c/p\u003e\n\u003cp\u003eFor each ELM-PFam pair highlighted by SLiMAn, a box is drawn and colored according to the corresponding confidence level. It contains three links and a check box. The latter can be used to select a pairing and collect the corresponding validated pairs for subsequent visualization in a table or a Cytoscape network (see below) [32]. The \u0026ldquo;Hit\u0026rdquo; link (upper right panel) opens up a pop-up window recapitulating all the parameters computed for the match (see Figure 5 and following sections for further details). The lower left box is a link to an alignment module and the possible launch of comparative modeling of the putative complex that the ELM motif and the matched domain could form (see 3.4). When actual modeling within the framework of SLiMAn is performed, a new link is created in the lower right box of the main result page. These additional steps may help assessing the likelihood of a pairing.\u003c/p\u003e\n\u003cp\u003eFinally, below each column in the main table, a consensus sequence is computed and highlighted using LogoJS [33] for each ELM motif type paired with a given PFam domain. A selection button enables switching from one ELM type to another (e.g.: LIG_SH3_1 to LIG_SH3_9).\u003c/p\u003e\n\u003cp\u003eModifying thresholds for the various parameters triggers a new computation resulting in a new table of pairings and new logos.\u003c/p\u003e\n\u003cp\u003e3.2.2. Specific information for each ELM-PFam pair.\u003c/p\u003e\n\u003cp\u003eIn each ELM/PFam pair box within the main table, pressing the \u0026ldquo;Hit\u0026rdquo; button grants direct access to information regarding a given putative interaction in a dedicated pop-up window (Figure 4). This window details the ELM entry (motif name and E-value, sequence boundaries and whether or not it is part of the validated instance according to ELM), the experimental data extracted from BioGRID and IntAct databases (indicating the number of hits and providing links to the associated publications in PubMed), key biophysical parameters from IUPred2A [34] and AlphaFold [35] as well as the pairing likelihood score computed by HSM [36].\u003c/p\u003e\n\u003cp\u003eThe IUPred2A section highlights the different scores (ANCHOR, IUPred local and global disorder or domain scores) for protein disorder predictions [34]. SLiMAn2 also provides the pLDDT score from pre-computed models stored in the AlphaFold database (AFDB) [29]. This score estimates the local accuracy, and was observed to nicely correlate to local disorder computed by IUPred. HSM is a dedicated predictor of interaction likelihood (from 0 to 1) for 6 types of recognition domains (PDZ, SH2, SH3, WW, WH1 and PTB) [36].\u003c/p\u003e\n\u003cp\u003eThe number of experiments corresponding to a given pair of proteins within BioGRID and IntAct are split in categories (e.g.: Low vs High throughput methods for BioGRID data, as discuss above). Select \u0026ldquo;toggle details\u0026rdquo; to see the precise type of experiments used, the date of deposit in the database (for IntAct data), a reference in PubMed as well as a link to the associated publication (as extracted from BioGRID and IntAct information). Additional technical information from IntAct and HuRI are also displayed (see Note 3).\u003c/p\u003e\n\u003cp\u003eIn the upper-right corner of the pop-up window, \u0026ldquo;PubMed queries\u0026rdquo; links are provided to search PubMed for publications describing each protein as well as those associating the two proteins of that pair (Figure 6). The latter corresponds to a search in PubMed combining all the alternative identifiers, accession and gene names for each protein in the pair, using logical operators. It enables quick access to related research articles found in literature and can compensate the lack of deposition in PPI repositories.\u003c/p\u003e\n\u003cp\u003e3.3 Hands on SLiMAn interactivity\u003c/p\u003e\n\u003cp\u003eThe main page of results shows a subset of putative pairings extracted from the query list with a set of parameters tuned automatically (Figure 5). This works well with most queries, highlighting some ELM motifs but hiding many others due to current parameters such as, a too stringent (= small) E-value, or restrictive disorder parameters. The interactive fine-tuning of parameters supports the identification of promising pairings and the detection of direct interactions through a given motif and a corresponding domain.\u003c/p\u003e\n\u003cp\u003eAdditional analysis can be also initiated from this webpage such as sequence alignment and comparative modeling of motif-domain complexes. This part is described in more detail at the end of this chapter as it may require some expertise in structural biology to be easily handled and truly fruitful. But most of the analyses using SLiMAn rely on the frontpage.\u003c/p\u003e\n\u003cp\u003e3.3.1 Parameter and filter description\u003c/p\u003e\n\u003cp\u003eWe now briefly define the different filters available, while their specific use is illustrated in the next section.\u003c/p\u003e\n\u003cp\u003eKnown PPIs can be selected from BioGRID or IntAct similarly to the precedent step (3.1.3). A heuristic scale (1-8) of confidence, combining several criteria for a SLiM-based PPI (predicted biophysical properties, experimental evidences...), was set to allow the user to quickly select PPIs with the highest level of confidence. Several types of filters can be applied :\u003c/p\u003e\n\u003col\u003e\n \u003cli\u003e\u0026ldquo;ELM\u0026rdquo; and \u0026ldquo;PSP+\u0026rdquo; filters are defined respectivley to:\u003col style=\"list-style-type: lower-alpha;\"\u003e\n \u003cli\u003eSet the upper bound threshold for the ELM motif E-value, set ELM class of SLiMs (cleavage, modification, targeting, degradation, docking, ligand) or the ELM validated instances.\u003c/li\u003e\n \u003cli\u003eFilter putative pairings based on the presence/absence of a given posttranslational modifications (PTM) and its requirement for the motif to be functional. As several SLiMs contain one or more PTMs, corresponding experimental information within each motif is made available through a link to the PhosphoSitePlus database (https://www.phosphosite.org/homeAction ).\u003c/li\u003e\n \u003c/ol\u003e\n \u003c/li\u003e\n \u003cli\u003eValues of structural parameters \u0026nbsp; (IUpred2A, AlphaFold) or pair likeliness (HSM) can be set and tuned to refine displayed SLiMAn predictions of PPis.\u003c/li\u003e\n \u003cli\u003eText filters are applicable to limit the analysis to a given type of motif, domain or protein. To that end, the corresponding ELM or PFAM expression or the protein name can be input in the visual toolbox (e.g.: SH3_1 in the ELM box or SH3 in the PFam box).\u0026nbsp;\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003e3.3.2 Consulting validated instances of SLiM-based PPIs\u003c/p\u003e\n\u003cp\u003e\u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; The use of these different filters for interactomic data analyses is illustrated with a step-by-step and hierarchical approach which gradually defines the molecular features of the different PPIs found in a published interactome of the two human tankyrases (130 proteins) [30], members of Poly(ADP-Ribose) Polymerase proteins (PARPs) family, and their respective meta-interactomes from BioGRID-IntAct (170 and 80 interactants for TNKS1 and 2). The final results for those three searches in SliMAn can be found on the webserver (https://sliman2.cbs.cnrs.fr/study/TNKS1-2-Inter.html ; \u0026nbsp;https://sliman2.cbs.cnrs.fr/study/TNKS1-Meta.html ; https://sliman2.cbs.cnrs.fr/study/TNKS2-Meta.html ). This approach aims at revealing potential SLIM-based interactions from the most likely to the less convincing ones while building a hierarchical molecular network.\u003c/p\u003e\n\u003cp\u003eStep 1: Identification of the most likely PPIs\u003c/p\u003e\n\u003col\u003e\n \u003cli\u003eRequest \u0026ldquo;Display all\u0026rdquo; within the Visualization panel.\u003c/li\u003e\n \u003cli\u003eSelect the ELM valid instances without applying any filter (i.e.: select \u0026ldquo;Validated Instances\u0026rdquo; within the ELM panel and switch off other parameters).\u0026nbsp;\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eFor the combined interactome TNKS1/2 [30], SLiMAn2 shows that 4 proteins would interact directly with tankyrase 1. These ELM valid instances involve a SLiM motif named Tankyrase Binding Motif (hereafter TBM) found in AXIN1, FNBP1, TERF1 and CASC3 and recognized by several ankyrin repeats (ARC or ANK) of TNKS1 and TNKS2 (Figure 5A) [37]. We can observe that these TBM motifs are predicted to be in unstructured regions (pLDDT \u0026lt; 60) and prone to fold upon binding (ANCHOR2 score \u0026gt; 0.4).\u003c/p\u003e\n\u003cp\u003eAt this stage of analysis, no other type of SLiM-based PPIs appears to involve TNKS1 or TNKS2 while two other PPIs are revealed as ELM instances between GSK3b-AXIN1 and GSK3b-TP53. Here, two distinct linear motifs in AXIN1, its TBM (21-28) and its GSK3b docking site (383-389) would bridge GSK3b and TP53 to the tankyrases (see Note 4).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eNote that within the BioGRID/IntAct meta-interactomes, 4 ELM instances (TBM detected in AXIN1, FNBP1, TB182 and TERF1) are found for TNKS1 and none for TNKS2. Furthermore, it shows only a partial overlap with the TNKS1/2 interactome with 3 common interactants (out of 5 in total).\u003c/p\u003e\n\u003cp\u003eStep 2: Detection of highly-confident SLiM-based PPIs\u003c/p\u003e\n\u003col\u003e\n \u003cli\u003eSwitch off the ELM instances and set the level of confidence to 8 with a low E-value (0.005). This increases the number of proteins displayed from 6 to 19 among 128 preys for TBNKS1/2. Fourteen proteins harbor ELM motifs putatively recognized by 7 PFam domains, which include 4 protein-kinase catalytic domains and 2 FHA domains on top of the Ankyrin repeats from the two tankyrases. Here, the ELM validated instance for CASC3 with TNKS1 is filtered out as no experimental evidence is recorded in BioGRID (requested Bio total \u0026gt; 0) or IntAct (requested IntAct+HuRI \u0026gt; 0) for that pair.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eSwitch on the disordered parameters (Anchor \u0026gt; 0.4; Short and long Disorder \u0026gt; 0.4; pLDDT score \u0026lt; 60) to filter out a few motifs (15 out 202) and keep only most likely ones. \u0026nbsp;\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eAt this level of confidence and filtering, two additional substrates of human tankyrases appear (BABA1 and GO45) and they are connected to no other proteins. AXIN1 is still connected to tankyrases as well as to GSK3s and therefore TP53. The table highlights three other sub-networks, one corresponding to a multimer of the protein-kinase Chk2 (through its FHA motif and domain), one connecting MRE11 and nibrin (again through an FHA pair) and a last one connecting the protein-kinase STK26 to STRN4, STRP1, and CT2NL, due to various phosphorylation motifs and multiple experimental evidences from BioGRID and IntAct.\u003c/p\u003e\n\u003cp\u003eAt a similar level of confidence and filtering, 10 PPIs involving 11 proteins are highlighted for the meta-interactome of TNKS1 whereas no additional PPIs was obtained for TNKS2. Interestingly, 5 proteins (AXIN1, FNBP1, GO45, TERF1 and BABA1) supposedly interacting with TNKS1 are common to the BioGRID/IntAct meta-interactome and the TNKS1/2 interactome.\u003c/p\u003e\n\u003cp\u003eAs illustrated with this example, SLiMAn facilitates the identification of both direct and indirect connections or possible ternary complexes. At such a stringent filtering, interactions or pairings predicted by SLiMAn merely match already well-known interactions. However, lowering the stringency, may result in too many pairings for simultaneous inspection.\u003c/p\u003e\n\u003cp\u003e3. Use text-based filtering to focus on one given type of pairings:\u0026nbsp;\u003c/p\u003e\n\u003col style=\"list-style-type: lower-alpha;\"\u003e\n \u003cli\u003eInput PFam query: TNKS\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eInput ELM query: DOC_ANK_TNKS_1\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eto display only the pairings involving Tankyrases and the DOC_ANK_TNKS_1 motif\u003c/p\u003e\n\u003cp\u003eThis selection leads to smaller table with one partner for TNKS2 and 6 for TNKS1 within the TNKS1/2 interactome. A similar trend is observed in the TNKS1 and TNKS2 meta-interactomes (with 6 interactants for TNKS1 and one for TNKS2).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e4. Select\u0026nbsp;\u003c/p\u003e\n\u003col style=\"list-style-type: lower-alpha;\"\u003e\n \u003cli\u003eLower the confidence level to 6: three more TNSK2 partners (BCR, 3BP2 and TERF1) appear and only one (3BP2) for TNKS1 within the TNKS1/2 interactome, while up to 13 partners are found in the meta-interactome of TNKS1 and 8 for TNKS2. Among the 8 tankyrase binders within TNKS1/2 interactome, 6 are found also in the meta-interactomes of TNKS1 or TNKS2, and 3 of them (TERF1, 3BP2, BABA1) are shared by the three. This is still representing a tiny portion of all the preys listed in the various studies using human tankyrases as baits. This suggests that more pairings to the tankyrases may have to be characterized (or not) through SLiMAn interface by navigating at much lower stringency.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eFilter for predicted disordered using the above threshold remove only one validated partner (PAGE4) in TNKS1 meta-interactome [38], which can be brought back by increasing the AlphaFold pLDDT threshold to 65 (instead of 60). This pre-filtering analysis indicates the disorder parameters to select TBM from various Tankyrase partners.\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eStep 3: Using biophysical filters to predict additional binders at low levels of confidence.\u003c/p\u003e\n\u003cp\u003eAs low confidence level can correspond to low disorder predictions and/or too few experimental evidences, one might want to counterbalance the low overall stringency by using parameters adapted to the particular pairings under scrutiny. As the TBM SLiM in ELM (DOC_TNKS_1) corresponds to highly flexible sequences, one can use rather stringent biophysical and structural features. Hence,\u0026nbsp;\u003c/p\u003e\n\u003col\u003e\n \u003cli\u003eSet IUPred2A and AlphaFold filters with high values (Anchor \u0026gt; 0.4; Short Disorder \u0026gt; 0.4; pLDDT score \u0026lt; 65).\u0026nbsp;\u003cbr\u003eThese values were derived from those observed for the TBM detected at high confidence level (first 8 and then 6; see above). This should allow us to dig into the (meta-)interactomes in a discovery mode and to spot more tankyrase partners actually bound through a TBM.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eDecrease the level of confidence to 4 (from 6).\u0026nbsp;\u003cbr\u003eThis \u0026nbsp;reveals 21 putative interactors among the 128 preys (16%) of the TNKS1/2 interactome and 37 (out of 170, 22%) in the case of the TNKS1 meta-interactome and 24 proteins (out of 80, 30%) for TNKS2. Of note, 11 of those binders are potential new PPIs, whereas 10 are shared between TNKS1/2 and the two meta-interactomes and only 6 are found in the three interactomes.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eDecrease the level of confidence to \u0026nbsp;2 (from 4).\u003cbr\u003eElven additional potential partners show up for the TNKS1/2 interactome bringing the total number of potential partners to 32. The low confidence scores (2) for most of those additional pairs (8/11), come from the lack of supporting experimental data within BioGRID and/or IntAct database, as the thresholds for disorder are stringent (and yield a confidence score of 2 by themselves). 14 binders are found in TNKS1/2 and the meta-interactomes, whereas 18 are new PPIs, and 7 are common to the three interactomes.\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eIt should be noted that a relatively small overlap is also observed between the two BioGRID/IntAct meta-interactomes with only 20 common proteins among the 62 potential TBM-dependant tankyrase binders.\u003c/p\u003e\n\u003cp\u003eAt first, these additional partners are questionable, as most of these new preys were obtained by only one independent experiment. Accordingly, they could need additional validations to ensure they indeed correspond to direct binding to one of the two tankyrases. Here, SLiMAn allows to point out which protein within the whole interactome, and which region in these proteins to prioritize in order to confirm this pairing.\u003c/p\u003e\n\u003cp\u003eStep 4 : \u0026nbsp;Adding alternative motif sequences in SLiMAn\u003c/p\u003e\n\u003cp\u003eThe relatively small number of tankyrase preys detected as direct binders, so far, indicates that other interactions are possibly still missed even at low stringency. Such failures could be due to domain-domain interactions (that cannot be shown explicitly by SLiMAn), to indirect interactions (see above and below), to the presence of divergent TBM or direct interactions mediated by other type of motifs and associated domains. The latter case is very likely as TNKS1 harbors several proline-rich SH3 binding motifs. Beside FNBP1 which possesses a SH3 domain but is also harboring a TBM, 3 nexins found in the TNKS1/2 interactome, do possess functional SH3 domains, while we detected no other connections to tankyrases otherwise. TNKS1 also harbors a PP2B docking motif and an FHA recognition motif. However, the latter is not phosphorylated according to PSP+ and, therefore, may be considered as not functional (see below).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eBut one cannot exclude that the ELM motif is defined with a too stringent sequence signature. In fact, alternative motifs have been described in several substrates of tankyrases [39, 40]. Different from the stringent canonical TBM signature (DOC_ANK_TNKS_1: .R..[PGAV][DEIP]G.), \u0026nbsp; the closely related (.R...[PGAV].G.) corresponds to a second motif with one additional residue within the same interacting partners [40].\u003c/p\u003e\n\u003cp\u003eAccordingly, search for potential alternative motifs that could fit into the TBM binding groove.\u0026nbsp;\u003c/p\u003e\n\u003col style=\"list-style-type: lower-alpha;\"\u003e\n \u003cli\u003eSurvey the crystal structures of tankyrase bound to various peptides\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eDig into the literature about tankyrase interactions.\u0026nbsp;\u003cbr\u003eStructural studies corroborated by directed mutagenesis and affinity measurements, point to the importance of an acidic residue in +2 position of the strictly conserved glycine [37]. These alternative TBMs may possess the new signatures R.{2,3}[PGAVSCT].G.[DE] or R.{3,4}[NDQEIVPT]G.[DE].\u003c/li\u003e\n \u003cli\u003eUse the \u0026ldquo;Create your own RegEx\u0026rdquo; option to manually add a new signature to the initial query step in order to screen for additional Tankyrase substrates. \u0026nbsp;\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eAdd the three patterns named respectively: Alt1 (.R...[PGAV].G.), Alt2 (R.{2,3}[PGAVSCT].G.[DE]) and Alt3 (R.{3,4}[NDQEIVPT]G.[DE]).\u0026nbsp;\u003cbr\u003eIn the TNKS1/2 interactome, the addition of alternative patterns combined with the canonical ELM-Ankyrin signature increases the number of potential Tankyrase interactors from 32 to 46 (for Alt-1), to 41 (Alt-2) and 41 (Alt-3), respectively.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eCompare, in this particular case, the enrichment levels for ADP-ribosylated proteins to evaluate each signature (i.e.: presence of the protein in the ADPriboDB 2.0 database [41] ).\u0026nbsp;\u003cbr\u003eThe proportion of ADP-ribosylated proteins increases from 48 % without filtering (complete interactome) to 65 % (ELM motif), 63% (Alt2) and 84% (Alt3) for each single filtering but for Alt1 (with only 43% of modified proteins). The best enrichment level (82 %) is obtained when combining the alternative sequence motifs Alt2 and Alt3 to the ELM canonical signature, which filter in 45 tankyrases substrates. Among these additional tankyrase partners, several were validated by low-throughput experiments (e.g.: 3BP2, 3BP5, RNF146). It also identified alternative TBM such as the second functional TBM in Pex14.\u0026nbsp;\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eThese results suggest considering alternative motifs for tankyrase recognition.\u003c/p\u003e\n\u003cp\u003eStep 5: Selection by ELM classes of SLIMs\u003c/p\u003e\n\u003cp\u003eTo focus or hierarchize the search for other motifs, SLiMAn2 also offers the possibility to analyze PPIs for each ELM class type with variable E-values. This mode is quite convenient as it reduces the size of the table of ELM-PFam pairing. The rational for filtering by ELM class type is also based to the different intrinsic properties of the PPIs. Indeed, SLiMs leading to the most stable PPIs (e.g.: SH3) are mainly presented in \u0026ldquo;Docking\u0026rdquo; (DOC) and \u0026ldquo;Ligand\u0026rdquo;(LIG) class types whereas more transient SLiM-based PPIs are found in \u0026ldquo;Modification\u0026rdquo; (MOD), \u0026ldquo;Cleavage\u0026rdquo;(CLV) and \u0026ldquo;Targeting\u0026rdquo; (TRG). In addition, different SliMs have distinct tendencies for disorder and for folding upon binding.\u003c/p\u003e\n\u003cp\u003e1. Lower the confidence level from 8 to 2 to search for other likely direct PPis of Tankyrases in the TNKS1/2 interactome.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eSimilar disorder parameters than for the TBM-Ankyrin PPI were used but other filtering can be also set for each ELM-PFam pairs. For some DOC ELM classes (PDZ, SH3, SH2), SLiMAn integrates HSM biophysical prediction, enhancing filtering options [36].\u003c/p\u003e\n\u003cp\u003e2. Filter by name with \u0026ldquo;SH3\u0026rdquo; the two well-known interaction motifs of TNKS1.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eIn fact, the high-confidence interaction of FNBP1 with TNKS1 does not involve a TBM but a SH3 polyproline motif. Other SH3-based PPIs have lower levels of confidence (5) corresponding to three syntaxins (SNX9, 18 and 33).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e3. Use HSM filters to rank the multiple pairing through SH3 motifs.\u003c/p\u003e\n\u003cp\u003ePrecisely, 14 proteins are found to potentially recognize 13 SLiMs in TNKS1, from three different classes (LIG, DOC and MOD) and localized in three N-terminal highly disordered regions (1-10; 24-83 and 145-166). By filtering for LIG and DOC ELM class types, it remains 10 potential interactors with FHA (KIF1a, KIF1b, NBN, SLMAP, CHK2) and the already mentioned SH3 (FNBP1, SNX9, SNX18 and SNX33) as well as Metallophosphoesterase (MRE11) domains. From them, KIF1b, MRE11 are already directly connected via TBM motifs to the tankyrases. Of note, apart from FNBP1, none of these potential TNKS1/2 binders are present in TNKS1 or TNKS2 meta-interactomes. However, a favorable SH3 based PPI is also predicted in the TNKS1 meta-interactome between TNKS1 and UBS3B. For TNKS2, similar parameters reveal no direct SLiM-based PPI in the TNKS1/2 interactome as well as in the TNKS2 meta-interactome in agreement with the lack of disordered N-terminal part compared to TNKS1.\u003c/p\u003e\n\u003cp\u003eAfter similar step-by-step selections for the other SLiM class types (TRG, DEG, CLV), 17 new PPIs, composed of 4 direct SH3 PPIs with TNKS1 and 13 indirect (1 LIG, 2 DEG, 10 TRG) complete the TNKS1/2 interactome. Overall, 10 new direct or indirect SLIM-based partners have been added to the network on top of 42 proteins.\u003c/p\u003e\n\u003cp\u003eStep 6: PTMs and recognition of MOD class SLiMs\u003c/p\u003e\n\u003cp\u003eSLiMs and PTMs are tightly interconnected, although some protein modifications may occur due to chemical reactants with little site specificity (but for the modified residue) such as sulphur oxidation. By essence, most PTM sites should be associated with a SLiM, although not all have been precisely defined already [19]. Only a subset has been written in the \u0026quot;Modification\u0026quot; (MOD) class in the ELM database. These SLiMs are recognized by enzymes most likely through a transient interaction leading to the modification of one residue. Because of the transient nature of these interactions, we may not expect to detect them with most techniques dedicated to interactomics studies. Nevertheless, these modifications are often of uttermost importance for the functioning of macromolecules and need to be identified. Therefore, other validation schemes are required.\u003c/p\u003e\n\u003cp\u003eSLiMAn highlights the residue that should be modified for a given MOD motif. It also highlights any residue if a PTM has been annotated in the PSP+ database (for a small set of model organisms including mainly human and two rodents). A color code and a filtering scheme were set in the new version of SLiMAn (PTM observed or not) to ease the selection of the most favorable MOD SLiMs (Table 1).\u003c/p\u003e\n\u003cp\u003eUnfortunately, while ELM precisely defines the enzyme involved in those modifications (e.g.: MOD_CK2_1), the associated PFam domain comprises a large set of related proteins (e.g.: PF00069 and PF07714 for the majority of protein-kinases). Accordingly, SLiMAn is misled and frequently connects a motif with various enzymes for the same functional class ignoring the actual specificity, as illustrated below. This pairing should be cautiously considered when listed in a SLiMAn output.\u003c/p\u003e\n\u003col\u003e\n \u003cli\u003eFilter for MOD by setting disorder: Anchor \u0026gt; 0.4; Short and long Disorder \u0026gt; 0.4; pLDDT score \u0026lt; 65), 16085 PPIs at confidence level 2 are highlighted in the TNSK1/2 interactome. Most E-values are weak (\u0026gt;0,01) and correspond to a high frequency motif sequences.\u003c/li\u003e\n \u003cli\u003eFilter with PSP+ to select motifs for which the critical PTM has been experimentally detected (Table 1). The number of predicted pairings is 5735 among which 244 are supported by experimentally observed PPIs in BioGRID and/or IntAct database. These 244 PPIs involve 5 protein-kinases (TAOK2, CHK2, STK26 and GSK3Aa\u0026nbsp;and GSK3B)b\u0026nbsp;and 21 substrates (containing multiple motifs). In comparison, for the TNKS1 meta-interactome, similar filtering leads to 746 PPIs at confidence level 2 supported by 6 enzymes (STK11, STK36, TINIK, TITIN, PTEN and M4P4) and 28 substrates. For the TNKS2 meta-interactome, 3 kinases (PTEN, STK11 and MK01) and 8 substrates are potentially involved in 102 PPis. Whereas AXIN1, a well-known substrate of tankyrases, is present in the three interactomes, GSK3 kinases are however surprisingly absent, that is probably due to the higher prevalence of direct PPIs in the meta-interactomes, at least for the tankyrases. This observation is also supported by a very low number of indirect PPIs (only 2 for TNKS1) that have been found for the two tankyrases meta-interactomes.\u003c/li\u003e\n \u003cli\u003eUse PSP+ information for additional filtering. Each motif should be scrutinized by navigating between SLiMAn and PSP+ database, which is easily accessible for each annotated motif PTM. Using PSP+ indications, two GSK3 phosphorylation sites on MCL1, already link to the tankyrases, can be validated. Indeed, a transient protein complex might bring together MCL1, tankyrases, AXIN1 and GSK3 kinases. Similarly, TP53 phosphorylation by CHK2 (T18) appear also highly likely.\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eAnother illustration for the usefulness of these tools is the highly sophisticated scenario that links GSK3b to AXIN1, that can be anticipated with SLiMAn2. The prior link of GSK3b involved a previously selected docking SLIM-based PPI =. Furthermore, AXIN1 phosphorylation by GSK3b is itself phospho-dependent as the motif must be primed. Despite its low e-value (0,026), the corresponding motif can be validated as information from PSP+ confirms that the motif is phosphorylated at the two required positions (S75 and T79).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eOverall, 10 MOD PPIs, involving 4 kinases and 5 substrates, have been selected in the TNKS1/2 interactome. \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp; \u0026nbsp;\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eStep 7: Using PSP+ to filter other PTM-dependant PPis.\u003c/p\u003e\n\u003cp\u003eSeveral proteins (Kif1a, Kif1b, CHK2, NBN and SLMAP) in the TNKS1/2 interactome contain an FHA domain which recognizes a phosphorylated threonine in the LIG_FHA motif. As the e-value of the ELM FHA motif is quite high (\u0026gt;0,005), SLiMAn predicts a high number (3505) of putative FHA_1 based PPIs that can be further select :\u003c/p\u003e\n\u003col\u003e\n \u003cli\u003eSwitch off all ELM classes but the LIG one.\u003c/li\u003e\n \u003cli\u003eSelect motifs harboring a modified residue with PSP+ (Table1).\u003cbr\u003eThe number of SLiMs drops to 265 comprising 155 mono-modified and 110 multi-modified motifs.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eApply \u0026nbsp;BioGRID/IntAct databases filters\u003cbr\u003eThe number of potential FHA_1 motifs involved in known PPIs decreases to 24 (14 mono and 10 multi-modified motifs).\u003c/li\u003e\n \u003cli\u003eUse PSP+ information for additional filtering\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eAfter a survey of these particular PTM-modified SLIMs, a total of 18 proteins, not interacting by a TBM, can be linked to tankyrases for the TNKS1/2 interactome. Interestingly, 61 % (11/18) of these additional partners are ADP-ribosylated suggesting that they do belong to the TNKS1/2 interactome.\u003c/p\u003e\n\u003cp\u003eStep 8: PTMs modulating SLiMs-based PPIs\u003c/p\u003e\n\u003cp\u003eBeside the MOD class (see step 6), all the other ELM classes may contain motifs involving modified residues. The modification can be mandatory for the recognition (designated as primary/mandatory) such as the phosphorylation of a tyrosine for a SH2 motif or a threonine for a FHA motif (see step 7). Alternatively, it may not be required (designated as secondary/accessory), although it may still interfere with any binding event. Some secondary modifications have been shown to be important functional switches, as mandatory ones are, but secondary PTM can be neutral, favorable or unfavorable to binding [42]. \u0026nbsp;\u003c/p\u003e\n\u003cp\u003eAccordingly, it is important to discriminate these two types of modifications depending on the motif under scrutiny. Hence, SLiMAn indicates, like for the MOD SLiMs, the PTM required for a given ELM motifs but also those detected in the PSP+ database. The color code and a filtering scheme differentiates the various situations (PTM observed or not; required or not), in support of more \u0026nbsp;accurate searching for important motifs requiring PTMs or harboring secondary switches.\u003c/p\u003e\n\u003col\u003e\n \u003cli\u003eSwitch off all ELM classes but the LIG one.\u003c/li\u003e\n \u003cli\u003eSelect \u0026ldquo;no or accessory PTM\u0026rdquo; (u U \u003cu\u003eo\u003c/u\u003e).\u003c/li\u003e\n \u003cli\u003eAdjust disorder and confidence thresholds if necessary. Here, strict disorder is set on so that confidence can be very low (1).\u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003eThis selection highlights the presence of a phosphorylation site (S432) in the TRF1 binding motif (LIG_TRFH1) of NBN, although this modification is not required. Experimental data listed in PSP+ indicate that this modification is rather frequent and deleterious for the interaction between NBN and TRF2, a paralogue of TRF1. This illustrates another example of the utility of PSP+ selection tools for PTM analysis.\u003c/p\u003e\n\u003cp\u003e3.3.3 Interactome network viewing using Cytoscape\u003c/p\u003e\n\u003cp\u003eOnce a selection is achieved, it can be visualized in a dedicated Cytoscape window displaying the corresponding network. The progression of the analysis is illustrated in Figure 7.\u003c/p\u003e\n\u003cp\u003eBy default, all the proteins harboring a validated pairing (ex.: TNKS-FNBP1) are connected. Each protein is shown as a purple rectangle that contains a green hexagon representing a PFam domain. A set of different colors characterizes each type of link between two protein partners: ELM motif/PFam domain pair, BioGRID or IntAct connections as well as HSM scoring. This potentially emphasizes dense sub-networks that usually correspond to macromolecular assemblies (based often also on domain-domain interactions) and/or singletons (not shown here) that may require further inspection. The latter may belong to the studied interactome - through unknown interaction - or correspond to spurious preys. Proteins can be rearranged within this window to better show these networking features. This may provide clues to resume searching for ELM/PFam pairing by focusing on particular proteins. This analysis is complementary to the analysis summarised in the main table.\u003c/p\u003e\n\u003cp\u003eFor example, in Figure 7 several protein complexes, like STRIPAK (striatin-interacting phosphatase and kinase), MRN (MRE11, RAD50, NBN) that were initially disconnected or only lightly connected to the Tankyrases are linked (red arrows) at the end of the analysis. The addition of alternative TBMs appear to directly link to MRE11 as well as the MRN complex to tankyrase. Furthermore, apparently indirect binders like MCL1 and CHK2 are also directly linked to Tankyrases.\u003c/p\u003e\n\u003cp\u003e3.4 Structural model prediction of a given PPI\u003c/p\u003e\n\u003cp\u003eFinally, SLiMAn can check for the presence of related complexes in the PDB. It requires a folded domain matching the referenced PFam in the structure as well as a peptide (less than 35 residues) matching the desired ELM motif, in order to be considered a potential template for comparative modeling. For each class in ELM, an extraction from the PDB led to suitable templates for almost half of the possible ELM/PFam pairs, for a total of 5325 extracted templates.\u003c/p\u003e\n\u003cp\u003eSLiMAn gives first access to an interactive webpage (SLiM-ID) to handle paired sequence alignments for both the motif and the PFam domain. Then, comparative modeling can be submitted and the results can be visualized on a second webpage (SLiM-IM). Models can be downloaded for further study. They can also be tagged as \u003cem\u003evalidated\u003c/em\u003e or \u003cem\u003ediscarded\u003c/em\u003e to further assist the user in defining the interaction network in the main result page.\u003c/p\u003e\n\u003cp\u003e3.4.1. Sequence to structure alignments\u0026nbsp;\u003c/p\u003e\n\u003cp\u003eIf pre-extracted templates are available for comparative modeling of a given ELM-PFam pairing,\u0026nbsp;\u003c/p\u003e\n\u003col\u003e\n \u003cli\u003eaccess the sequence alignment interface, which is presented under the SLiM-ID environment (see Figure 8A).\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eIn SLiM-ID, first a summary of the paired ELM motif and the matched PFam domain sequences is highlighted on the top of the page. In addition, double alignments (of the motif and the domain sequences) with potential templates of complexes are performed using two different tools, MAFFT [43] and BLAST [44].\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eELM motif and PFam domain boundaries are directly extracted from their respective databases. If needed, manually edit them to re-compute alignments with re-defined sequence boundaries (Figure 8B). The ELM motif is generally well aligned to the corresponding peptide within the template thanks to the conserved ELM signature. The PFam domain may be aligned on much more divergent templates (\u0026lt; 35% of sequence identity). This might indicate that the overall fold is conserved but this might not include the binding site. In case of too low sequence similarity, cautiously discard the match. In this situation, use alternative approaches to predict the desired complex (e.g.: using HADDOCK [45] pepATTRACK[46] or AlphaFold [28]). Because SLiMAn requires a perfect match between the ELM motif regular expression to detect the peptide in a template, it may sometimes miss suitable templates. Again, alternative routes to modeling are necessary in that case.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eTo guide the selection of the most suitable templates, several alignment metrics are computed: sequence identity (%ident), query coverage (%QueryCoverage), template coverage (%TemplateCoverage) and a conserved contact score (CCS and %CCS). At any time, the alignment table can be sorted according to one of these metrics (see Note 6). In addition, to facilitate the visual inspection of the alignments, residues belonging to the peptide-protein interface are coloured (green, orange and red) according to the contact distances (of 4.0, 5.5 and 7.0 Angstroms respectively).\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eBefore launching the modelling process, select the desired entries to serve as templates for comparative modelling, according to two options:\u0026nbsp;\u003col class=\"decimal_type\" style=\"list-style-type: lower-alpha;\"\u003e\n \u003cli\u003ea custom selection by checking the box on the left of each alignment in the table,\u0026nbsp;\u003c/li\u003e\n \u003cli\u003eby using the automated selection tools, to select top 5, non-redundant PDB, or all available templates.\u0026nbsp;\u003cbr\u003eOnce, the alignments have been optimized, validated and at least one template is selected (Figure 8), click the \u0026ldquo;Launch modelling\u0026rdquo; button to start the comparative modelling process using SCWRL3.0 [47].\u003c/li\u003e\n \u003c/ol\u003e\n \u003c/li\u003e\n\u003c/ol\u003e\n\u003cp\u003e3.4.2. Structure Modeling\u003c/p\u003e\n\u003cp\u003eDuring the modeling process (approximately a few seconds per model) of the complex by SCWRL3.0, identical side-chains are kept fixed during the optimization first of the domain (in presence of the peptide from the original template), then of the peptide (in presence of the modeled domain).\u003c/p\u003e\n\u003col\u003e\n \u003cli\u003eThe completion of modelling, triggers a re-direction to the SLiM-IM environment. In the example, the 3Dmol.js viewer is used to display the complexes (Figure 9) [48]. In addition, an interaction analysis is performed by BINANA [49], highlighting favourable hydrophobic contacts (grey spheres) and hydrogen bonds (black arrows) as well as potential steric clashes (red spheres).\u003c/li\u003e\n \u003cli\u003eAt the bottom of the page is displayed a table containing the various information and intermediate models generated along the process. This table holds the original PDBid, its extracted SLiM-domain templates, model of the domain, model of the motif and the reconstituted complex. Click on the displayed structures to visualized or downloaded for local analysis.\u0026nbsp;\u003c/li\u003e\n \u003cli\u003e\u0026nbsp;\u0026ldquo;Validate\u0026rdquo; or \u0026ldquo;Discard\u0026rdquo; models in the last column of the Table, based on own expertise.\u003c/li\u003e\n \u003cli\u003eClick on the \u0026ldquo;Save selection\u0026rdquo; button, to erase discarded models and include validated models in the hit prediction table in SLiM-IP. The latter will be easily searchable using the \u0026ldquo;SLiMIM valid models\u0026rdquo; filter.\u003c/li\u003e\n\u003c/ol\u003e"},{"header":"4. Notes","content":"\u003cp\u003e1. Pfam database is now integrated in Interpro consortium (https://www.ebi.ac.uk/interpro/).\u003c/p\u003e\n\u003cp\u003e2. SLiMAn adjusts its selection filters to bring a list of pairing neither too huge nor too tiny (if possible). These parameters should be adapted to each particular query and vary during an actual survey(see 3.3).\u003c/p\u003e\n\u003cp\u003e3. In IntAct, directional information regarding the actual bait and prey is available in most experiments. For HuRI, multiple directional two-hybrid assays can be listed as well.\u003c/p\u003e\n\u003cp\u003e4. The GSK3b\u0026nbsp;docking site in AXIN1 corresponds to a folded helix with low disorder scores by IUPred and AlphaFold while the phosphorylation motif in TP53 has a high \u003cem\u003eE-value\u003c/em\u003e (0.027). These connections would not appear with standard parameters of SLIMan (E-value \u0026lt; 0.005 or IUPRED score \u0026gt; 0.4).\u003c/p\u003e\n\u003cp\u003e5. As \u0026ldquo;Create your own RegEx\u0026rdquo; option offer the possibility to deal with the stringency ot a given motif, Non-canonical TBMs, R.{5}G and R.{10}G, could be also evaluated.\u003c/p\u003e\n\u003cp\u003e6. We strongly advise users to mainly focus on both the query coverage and %CCS, as the first is representing the percentage of query amino acids that will be modelled while the latter corresponds to the percentage of conserved contacts between the motif and the domain in the future comparative model.\u0026nbsp;\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eStumpf MPH, Thorne T, Silva E de, et al (2008) Estimating the size of the human interactome. Proc Natl Acad Sci (U S A) 105:6959\u0026ndash;6964\u003c/li\u003e\n\u003cli\u003eBraakman I and Hebert DN (2013) Protein folding in the endoplasmic reticulum. Cold Spring Harb Perspect Biol 5:a013201\u003c/li\u003e\n\u003cli\u003eBozaykut P, Ozer NK, and Karademir B (2014) Regulation of protein turnover by heat shock proteins. Free Radic Biol Med 77:195\u0026ndash;209\u003c/li\u003e\n\u003cli\u003eTorres-Quesada O, Mayrhofer JE, and Stefan E (2017) The many faces of compartmentalized PKA signalosomes. Cell Signal 37:1\u0026ndash;11\u003c/li\u003e\n\u003cli\u003eHuttlin EL, Ting L, Bruckner RJ, et al (2015) The BioPlex Network: A Systematic Exploration of the Human Interactome. Cell 162:425\u0026ndash;440\u003c/li\u003e\n\u003cli\u003eHuttlin EL, Bruckner RJ, Paulo JA, et al (2017) Architecture of the human interactome defines protein communities and disease networks. Nature 545:505\u0026ndash;509\u003c/li\u003e\n\u003cli\u003eCafarelli TM, Desbuleux A, Wang Y, et al (2017) Mapping, modeling, and characterization of protein-protein interactions on a proteomic scale. Curr Opin Struct Biol 44:201\u0026ndash;210\u003c/li\u003e\n\u003cli\u003eRuwolt M, Piazza I, and Liu F (2023) The potential of cross-linking mass spectrometry in the development of protein-protein interaction modulators. Curr Opin Struct Biol 82:102648\u003c/li\u003e\n\u003cli\u003ePaiano A, Margiotta A, De Luca M, et al (2019) Yeast Two-Hybrid Assay to Identify Interacting Proteins. Curr Protoc Protein Sci 95:e70\u003c/li\u003e\n\u003cli\u003eLuck K, Kim D-K, Lambourne L, et al (2020) A reference map of the human binary protein interactome. Nature 580:402\u0026ndash;408\u003c/li\u003e\n\u003cli\u003eBenz C, Ali M, Krystkowiak I, et al (2022) Proteome-scale mapping of binding sites in the unstructured regions of the human proteome. Mol Syst Biol 18:e10584\u003c/li\u003e\n\u003cli\u003eBao Y, Pan Q, Xu P, et al. (2023) Unbiased interrogation of functional lysine residues in human proteome. Mol Cell. 83:4614-4632\u003c/li\u003e\n\u003cli\u003eYu C and Huang L (2023) New advances in cross-linking mass spectrometry toward structural systems biology. Curr Opin Chem Biol 76:102357\u003c/li\u003e\n\u003cli\u003eEdwards AM, Kus B, Jansen R, et al (2002) Bridging structural biology and genomics: assessing protein interaction data with known complexes. Trends Genet 18:529\u0026ndash;536\u003c/li\u003e\n\u003cli\u003eJansen R, Yu H, Greenbaum D, et al (2003) A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data. 302:449\u0026ndash;453\u003c/li\u003e\n\u003cli\u003eMayer BJ (2015) The discovery of modular binding domains: building blocks of cell signalling. Nat Rev Mol Cell Biol 16:691\u0026ndash;698\u003c/li\u003e\n\u003cli\u003eKumar M, Michael S, Alvarado-Valverde J, et al (2022) The Eukaryotic Linear Motif resource: 2022 release. Nucleic Acids Research 50:D497\u0026ndash;D508\u003c/li\u003e\n\u003cli\u003eTompa P, Davey NE, Gibson TJ, et al (2014) A million peptide motifs for the molecular biologist. Mol Cell 55:161\u0026ndash;169\u003c/li\u003e\n\u003cli\u003eKitamura N and Galligan JJ (2023) A global view of the human post-translational modification landscape. Biochem J 480:1241\u0026ndash;1265\u003c/li\u003e\n\u003cli\u003eDavey NE, Trav\u0026eacute; G, and Gibson TJ (2011) How viruses hijack cell regulation. Trends Biochem Sci 36:159\u0026ndash;169\u003c/li\u003e\n\u003cli\u003eGemovic B, Sumonja N, Davidovic R, et al Mapping of Protein-Protein Interactions: Web-Based Resources for Revealing Interactomes. 26:3890\u0026ndash;3910\u003c/li\u003e\n\u003cli\u003eSzklarczyk D, Kirsch R, Koutrouli M, et al (2023) The STRING database in 2023: protein-protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Res 51:D638\u0026ndash;D646\u003c/li\u003e\n\u003cli\u003eMirela Bota P, Hernandez AC, Segura J, et al (2023) CM2D3: Furnishing the Human Interactome with Structural Models of Protein Complexes Derived by Comparative Modeling and Docking. J Mol Biol 435:168055\u003c/li\u003e\n\u003cli\u003eOughtred R, Rust J, Chang C, et al (2021) The BioGRID database: A comprehensive biomedical resource of curated protein, genetic, and chemical interactions. Protein Sci 30:187\u0026ndash;200\u003c/li\u003e\n\u003cli\u003eDel Toro N, Shrivastava A, Ragueneau E, et al (2022) The IntAct database: efficient access to fine-grained molecular interaction data. Nucleic Acids Res. 50:D648-D653\u003c/li\u003e\n\u003cli\u003eReys V and Labesse G (2022) SLiMAn: An Integrative Web Server for Exploring Short Linear Motif-Mediated Interactions in Interactomes. J Proteome Res 21:1654\u0026ndash;1663\u003c/li\u003e\n\u003cli\u003eZhou X, Hu J, Zhang C, et al (2019) Assembling multidomain protein structures through analogous global structural alignments. Proc Natl Acad Sci (U S A). 116:15930-15938\u003c/li\u003e\n\u003cli\u003eEvans R, O\u0026rsquo;Neill M, Pritzel A, et al (2022), Protein complex prediction with AlphaFold-Multimer, https://www.biorxiv.org/content/10.1101/2021.10.04.463034v2\u003c/li\u003e\n\u003cli\u003earadi M, Anyango S, Deshpande M, et al (2021) AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res 50:D439\u0026ndash;D444\u003c/li\u003e\n\u003cli\u003eLi X, Han H, Zhou M-T, et al (2017) Proteomic Analysis of the Human Tankyrase Protein Interaction Network Reveals Its Role in Pexophagy. Cell Reports 20:737\u0026ndash;749\u003c/li\u003e\n\u003cli\u003eHornbeck PV, Chabra I, Kornhauser JM, et al (2004) PhosphoSite: A bioinformatics resource dedicated to physiological protein phosphorylation. Proteomics 4:1551\u0026ndash;1561\u003c/li\u003e\n\u003cli\u003eShannon P, Markiel A, Ozier O, et al. (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13:2498-2504. \u003c/li\u003e\n\u003cli\u003ePratt H, Weng Z. LogoJS: a Javascript package for creating sequence logos and embedding them in web applications. Bioinformatics. 2020 Jun 1;36(11):3573-3575.\u003c/li\u003e\n\u003cli\u003eM\u0026eacute;sz\u0026aacute;ros B, Erdos G, and Doszt\u0026aacute;nyi Z (2018) IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding. Nucleic Acids Res 46:W329\u0026ndash;W337\u003c/li\u003e\n\u003cli\u003eJumper J, Evans R, Pritzel A, et al (2021) Highly accurate protein structure prediction with AlphaFold. Nature 596:583\u0026ndash;589\u003c/li\u003e\n\u003cli\u003eCunningham JM, Koytiger G, Sorger PK, et al (2020) Biophysical prediction of protein-peptide interactions and signaling networks using machine learning. Nat Methods 17:175\u0026ndash;183\u003c/li\u003e\n\u003cli\u003eGuettler S, LaRose J, Petsalaki E, et al (2011) Structural basis and sequence rules for substrate recognition by Tankyrase explain the basis for cherubism disease. Cell 147:1340\u0026ndash;1354\u003c/li\u003e\n\u003cli\u003eKoirala S, Klein J, Zheng Y, et al (2020) Tissue-Specific Regulation of the Wnt/\u0026beta;-Catenin Pathway by PAGE4 Inhibition of Tankyrase. Cell Rep 32:107922\u003c/li\u003e\n\u003cli\u003eMorrone S, Cheng Z, Moon RT, et al (2012) Crystal structure of a Tankyrase-Axin complex and its implications for Axin turnover and Tankyrase substrate recruitment. Proc Natl Acad Sci (U S A). 109:1500-1505\u003c/li\u003e\n\u003cli\u003eDaRosa PA, Klevit RE, and Xu W (2018) Structural basis for tankyrase-RNF146 interaction reveals noncanonical tankyrase-binding motifs. Protein Sci 27:1057\u0026ndash;1067\u003c/li\u003e\n\u003cli\u003eAyyappan V, Wat R, Barber C, et al (2021) ADPriboDB 2.0: an updated database of ADP-ribosylated proteins. Nucleic Acids Research 49:D261\u0026ndash;D265\u003c/li\u003e\n\u003cli\u003eGogl G, Jane P, Caillet-Saguy C, et al.. (2020) Dual Specificity PDZ- and 14-3-3-Binding Motifs: A Structural and Interactomics Study. Structure. 28:747-759\u003c/li\u003e\n\u003cli\u003eKatoh K, Misawa K, Kuma K, et al (2002) MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res 30:3059\u0026ndash;3066\u003c/li\u003e\n\u003cli\u003eAltschul SF, Gish W, Miller W, et al (1990) Basic local alignment search tool. J Mol Biol 215:403\u0026ndash;410\u003c/li\u003e\n\u003cli\u003ede Vries SJ, van Dijk M, and Bonvin AMJJ (2010) The HADDOCK web server for data-driven biomolecular docking. Nat Protoc 5:883\u0026ndash;897\u003c/li\u003e\n\u003cli\u003ede Vries SJ, Rey J, Schindler CEM, et al (2017) The pepATTRACT web server for blind, large-scale peptide-protein docking. Nucleic Acids Res 45:W361\u0026ndash;W364\u003c/li\u003e\n\u003cli\u003eWang Q, Canutescu AA, and Dunbrack RL (2008) SCWRL and MolIDE: computer programs for side-chain conformation prediction and homology modeling. Nat Protoc 3:1832\u0026ndash;1847\u003c/li\u003e\n\u003cli\u003eRego N and Koes D (2015) 3Dmol.js: molecular visualization with WebGL. Bioinformatics 31:1322\u0026ndash;1324\u003c/li\u003e\n\u003cli\u003eYoung J, Garikipati N, and Durrant JD. (2022) BINANA 2: Characterizing Receptor/Ligand Interactions in Python and JavaScript. J Chem Inf Model. 62:753-760\u003c/li\u003e\n\u003c/ol\u003e"},{"header":"Tables","content":"\u003cp\u003eTable 1 is available in the Supplementary Files section.\u003c/p\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":true,"hideJournal":true,"highlight":"","institution":"Centre de Biologie Structurale (CBS), CNRS, INSERM, Univ. Montpellier, Montpellier, France","isAcceptedByJournal":false,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":" Interactomes, proteome annotations, comparative modeling, protein sequence","lastPublishedDoi":"10.21203/rs.3.rs-3973092/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-3973092/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eInteractomics is bringing a deluge of data regarding protein-protein interactions (PPIs) which are involved in various molecular processes in all types of cells. However, this information does not easily translate into direct and precise molecular interfaces. This limits our understanding of each interaction network and prevents their efficient modulation. A lot of the detected interactions involve recognition of short linear motifs (SLiMs) by a folded domain while others rely on domain-domain interactions. Functional SLiMs hide among a lot of spurious ones, making deeper analysis of interactomes tedious. Hence, actual contacts and direct interactions are difficult to identify.\u003c/p\u003e\n\u003cp\u003eConsequently, there is a need for user-friendly bioinformatic tools, enabling \u0026nbsp;rapid molecular and structural analysis of SLiM-based PPIs in a protein network. In this chapter, we describe the use of the new webserver SLiMAn to help digging into SLiM-based PPIs in an interactive fashion.\u003c/p\u003e","manuscriptTitle":"Detection and analysis of Short Linear Motif based protein-protein interactions with SLiMAn2 web server","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-02-23 05:47:13","doi":"10.21203/rs.3.rs-3973092/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"5c01c6d1-6624-425e-8748-a7293646f363","owner":[],"postedDate":"February 23rd, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"posted","subjectAreas":[{"id":28869724,"name":"Structural Biology"},{"id":28869725,"name":"Bioinformatics"}],"tags":[],"updatedAt":"2024-02-23T05:47:13+00:00","versionOfRecord":[],"versionCreatedAt":"2024-02-23 05:47:13","video":"","vorDoi":"","vorDoiUrl":"","workflowStages":[]},"version":"v1","identity":"rs-3973092","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-3973092","identity":"rs-3973092","version":["v1"]},"buildId":"qtupq5eGEP_6zYnWcrvyt","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: preprint-html ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00