A network motif is defined as a statistically significant and recurring subgraph pattern within a network. Most existing instance collection methods are not feasible due to high memory usage issues and provision of limited network motif information.
Trang 1The Author(s) BMC Bioinformatics 2017, 18(Suppl 12):423
DOI 10.1186/s12859-017-1822-6
NemoProfile as an efficient approach to
network motif analysis with instance collection
From 12th International Symposium on Bioinformatics Research and Applications (ISBRA 2016)
Minsk, Belarus 5-8 June 2016
Abstract
Background: A network motif is defined as a statistically significant and recurring subgraph pattern within a
network Most existing instance collection methods are not feasible due to high memory usage issues and provision
of limited network motif information They require a two-step process that requires network motif identification prior
to instance collection Due to the impracticality in obtaining motif instances, the significance of their contribution to problem solving is debated within the field of biology
Results: This paper presents NemoProfile, an efficient new network motif data model NemoProfile simplifies
instance collection by resolving memory overhead issues and is seamlessly generated, thus eliminating the need for costly two-step processing Additionally, a case study was conducted to demonstrate the application of network motifs to existing problems in the field of biology
Conclusion: NemoProfile comprises network motifs and their instances, thereby facilitating network motifs usage in
real biological problems
Keywords: NemoProfile, NemoCollect, ESU, Systems biology, Biological network, Network motif, Essential protein
Background
Systems biology elucidates, models, and predicts the
behavior of all biological components and their
interac-tions Its emphasis on the interconnections of molecules
produced biological networks as described in Fig 1, where
nodes are molecules and edges are interactions between
them Understandably, various graph theory topics are
substantially applied to resolve various biological
prob-lems, such as prediction of biological function, detection
of protein complexes, discovery of new interactions,
evo-lutionary analysis, information integration, diagnosis of
disease, and drug design [1]
Network motif analysis is one of the graph theory
methods used to find biologically relevant functions in
networks [2] A network motif is defined as an overly
fre-quent and unique subgraph pattern in a network, and it
*Correspondence: kimw6@uw.edu
Division of Computing and Software Systems, School of Science, Technology,
Engineering, and Mathematics (STEM), University of Washington Bothell,
18115 Campus Way NE, 98011-8246 Bothell, WA, USA
has been applied to solve various biological and medi-cal problems: predicting protein-protein interactions [3], determining protein functions [4], detecting breast-cancer susceptibility genes [5], investigating for evolutionary con-servation [6, 7], and discovering essential proteins [8, 9] Furthermore, a broad spectrum of applications has been explored: ‘motif clustering’ [10], ‘motif themes’ [11], ‘rel-ative graphlet frequency distances’ [12, 13], ‘motif modes’ [14], and ‘MotifScores’ [15]
However, identifying network motifs is intrinsically very costly, and this high computational cost restricts extensive and exhaustive experiments for real problems The pro-cess involves enumeration of millions of subgraphs in the input graph, and classification through canonical labeling
or isomorphic testing Then, a network motif ’s unique-ness is established through rigorous statistical testing
in a huge random pool Consequently, various heuristic methods and parallel algorithms have been proposed that alleviate the performance concerns of exhaustive search methods [16]
© The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0
International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made The Creative Commons Public Domain Dedication waiver
Trang 2Fig 1 Examples of biological networks: a a metabolic network is composed of different types of nodes and edges; b all the nodes in a gene regulatory network are genes, and directed edges represent a regulatory process; c a protein-protein interaction network is composed of proteins,
and their binary interactions are undirected edges
Network motifs may remain meaningless unless their
biological significance is properly evaluated In order to
determine biological relevance, individual motif instances
need to be collected and evaluated in the context of
bio-logical systems However, most motif-finding algorithms
provide only frequency and statistical significance of each
pattern, which restricts its usability for real-world
prob-lems Therefore, we introduce a new network motif
rep-resentation to overcome this problem, and define it as
NemoProfile
In this paper, we show how efficiently NemoProfile
is generated and how this significantly reduces motif
instance collection time We also provide a case study
where NemoProfile is directly applied to the prediction of
essential proteins from protein-protein interaction (PPI)
networks
Methods
Here, we introduce a new network motif representation,
as NemoProfile NemoProfile can be effortlessly
gen-erated while detecting network motifs, and effectively
collects network motif instances We designed and
imple-mented a program based on a flowchart illustrated in
Fig 2 to provide three separate output options:
NemoPro-file, NemoCount, and NemoCollect
NemoCount, which implements ESU (Enumerate
SUb-graphs) algorithm [17], provides the frequency and
statis-tical testing result only NemoProfile and NemoCollect are
described followed by the definition of network motifs
Network motif
Network motifs are defined as frequent and unique
sub-graphs in a network Formally, if G = (V, E) is a graph and
k ranges from 3 to n << |V|, then a network motif m is
a connected subgraph of size k in G, which appears more
frequently than usual In the definition of network motifs,
‘more frequent than usual’ refers to a structural
unique-ness and it is determined by p-value as in Eq (1) or z-score
as in Eq (2) after a number of random graphs have been generated
p-value(m) = 1
N
N
n=1
c(n),wherec(n) =
1, if f R (m) ≥ f G (m)
0, otherwise.
(1)
z-score(m) = f G (m) − average(f R (m))
Here, f G (m) is the frequency of motif m in G and f R (m)
be that of motif in random graph R Also, average (f R (m)) and std (f R (m)) refer to the average and standard deviation
of frequencies in random networks, respectively
Gener-ally, a subgraph with p-value less than 0.01 or z-score
greater than 2.0 is considered as a network motif
Figure 3 describes how to find size 3 network motifs
from the input graph G in the upper left corner by ESU
algorithm [17] The method enumerated a total of 16 sub-graphs of size 3, but one instance, ({1,2,3}), is a triangle type while others are all linear types Although the fre-quency of triangle type is much less than the linear type,
p-value and z-score determine that the triangle type is a network motif Therefore the frequency (or count) of the network motif is 1, and the instance of the network motif
is ({1,2,3}) We want to note that all existing software
pro-grams provide the frequency, and p-value or z-scores of
network motifs but not the instances of network motifs due to heavy memory overhead In this paper, we put more
Trang 3The Author(s) BMC Bioinformatics 2017, 18(Suppl 12):423 Page 39 of 131
Fig 2 Flow chart of a network motif finding program producing NemoProfile, NemoCollect, and NemoCount(ESU) It searches all subgraphs in a
given network by ‘Enumeration,’ then asks if instances should be collected If yes, all the instances are collected as ‘SubgraphProfile’ form Otherwise the occurrences of each graph pattern is recorded as in the ‘Counting’ step These counts are essential to determine which graph pattern is a network motif as its relative frequency is compared in the random graph pool In the final output, the SubgraphProfile is used to produce
NemoProfile or NemoCollect NemoCount output can be produced without the SubgraphProfile
weight on the importance of network motif instances by
introducing a NemoProfile
Network motif detection algorithms
Various network-motif-finding algorithms are available,
classified into network-centric and motif-centric
algo-rithms [16] Network-centric algoalgo-rithms identify network
motifs while exploring subgraphs in the input graph,
whereas motif-centric algorithms count the instances for each pattern in a predefined query set Then its signif-icance is determined through various statistical testing
in a large random pool to determine network motifs, as summarized in Fig 4
Although network-centric algorithms have the bene-fit that subgraphs that are not in the input graph will never be considered, the inevitable enumeration process
Fig 3 An example of a graph with network motifs and their instances, courtesy of paper [17]
Trang 4Fig 4 Network-centric methods consists of enumeration and classification, random graph generation, and statistical testing For example, out of six
non-isomorphic subgraphs of size 4 (upper-right table), three patterns at the bottom are determined as network motifs
is heavily expensive Motif-centric algorithms can reduce
classification time if combined with symmetry breaking
or mapping strategies, and can directly verify whether a
specific pattern is a network motif or not [16, 17]
How-ever, the number of non-isomorphic subgraphs (patterns)
increases exponentially with the size of motifs, therefore
even listing them is intractable As an example, there are
11,716,571 patterns for motif size 10, as shown in Table 1
Many motif search programs are also available [18]:
MFinder [19], FANMOD [20], Kavosh [21], Mavisto [22],
Table 1 Number of non-isomorphic subgraphs for undirected
and directed graphs with up to 10 vertices [31]
Vertices Number of non-isomorphic subgraphs
Undirected Directed
10 11,716,571 341,247,400,399,400,000,000
and NeMoFinder [23] follow the network-centric meth-ods Motif-centric methods are available with Grochow’s [24], and MODA [25] However, most of them provide only frequency and statistical significance as in Fig 5, because collecting all instances of each pattern creates
a serious memory overhead problem Hypothetically, the number of subgraphs in the input graph is|E G||Em|where
|E G | is the number of edges in the input graph and |E m|
is the number of edges in motif m [26] That means most
biological networks have several tens or even hundreds
of millions of subgraphs, even for small motifs There-fore, instances of network motifs have to be collected
as post-processing if necessary, and it usually requires more efforts than detecting network motifs, as this step is unavailable with current programs
Considering that most real world problems that use network motifs require a knowledge of what nodes and edges actually belong to network motifs [8, 9], providing their instances will greatly increase the usability of net-work motifs Therefore, the net-work in this paper focuses
on the neglected task in network motif finding, which is collecting instances efficiently and utilizing them for real biological problems
NemoProfile
To reduce computational cost but still provide valuable results, we propose a new network motif representation,
Trang 5The Author(s) BMC Bioinformatics 2017, 18(Suppl 12):423 Page 41 of 131
Fig 5 The example format of network motif finding outputs, which shows frequency and significance for each pattern
NemoProfile that relates each node to network motifs
as a profile matrix while identifying network motifs As
illustrated in Fig 2, a SubgraphProfile, T as an n × m
matrix is first constructed where n is the number of nodes
in the input graph and m is the number of all subgraph
patterns of size k While enumerating, T ij increments by
1 if a pattern m j includes node v i After network motifs
are determined NemoProfile takes the network motif
columns from T.
For example, we can find 14 instances of “graph78”
size-3-subgraphs and 2 of “graph238” size-3-subgraphs if
a target graph has 9 vertices and 10 edges as shown in
Fig 6a While the Fanmod program that implements ESU
can trace all 16 instances, saving all instances as sets of
vertices, such as, graph78 = ({1, 2, 4},{1, 2, 5}, {1, 2,
6}, ,{2, 3, 9} ), graph238 =({1, 2, 3}, {3, 8, 9}) causes
a great amount memory overhead Therefore, Fanmod (ESU algorithm) provides the frequencies and statistical results of each type, but discards the motif instances On the other hand, NemoProfile saves the set of instances as
a matrix so that the frequency of each node’s involvement
to each pattern is recorded as shown in Fig 6a Figure 6b describes how to recover the network motif instances
as sets of vertices from a NemoProfile, which generates NemoCollect
NemoCollect
We define NemoCollect as in Algorithm 1 describing the process to collect instances of a network motif m with
NemoProfile It derives an induced subnetwork comprised
Fig 6 The process of collecting instances for graph238: a SubgraphProfile is obtained in the process of ESU from the original input graph in the left From the graph patterns graph78 and graph238, “graph238” is determined to be a network motif through statistical analysis; b NemoProfile (left) is derived from the SubgraphProfile in (a), and effectively identifies an induced subgraph for motif “graph238” by collecting the nodes corresponding
to a non-zero value in the column of graph238 The induced subgraph with nodes 1, 2, 3, 8, and 9 will be processed with EnumerateSubgraph(ESU)
[17] to collect all the instances for graph238
Trang 6Fig 7 a The chart shows that the running time (Y-axis) to detect size 4 network motifs from various size (number of nodes, X-axis) of PPI networks is very similar with all three options; b This chart compares times of ESU, NemoProfile, and NemoCollect methods while detecting various sizes of
motifs, from size 3 to size 8 NemoCollect takes longer than the other options for larger motif sizes
by all nodes whose m-corresponding column value being
non-negative The subnetwork is fed back to
Enumerate-Subgraph function from [17] to collect the instances of
motif m Figure 6 illustrates the process with an input
graph and NemoProfile
Algorithm 1:NEMOCOLLECT USINGNEMOPRO
-FILE ANDESU [17]
input : Graph G = (V, E), k : motif size,
NemoProfile, motif m
output
:
A set of instances of motif m
1 Let nodeIdVec as a set of the node IDs for
∀node ∈ G do
2 ifnode corresponds to motif m then
3 add to nodeIdvec;
4 Induce a subnetwork S of nodeIdvec
5 Call EnumerateSubgraph (S, k)
6 Output a set of instances of motif m
Results and discussion
We tested the efficiency and effectiveness of NemoPro-file with a number of PPI networks that are available in the DIP database [27] in a Linux operating system, Xeon Core i7 with 5,959 MiB system memory The DIP database includes eight different species of protein-protein inter-action (PPI) networks, which are manually and computa-tionally curated Almost every three months, the networks are updated by adding or removing proteins and their interactions We selected five E coli core PPIs, five S cere-visiae core PPIs, and six H sapiens core PPI networks that are updated each year from 2010 through 2014
We designed and implemented a program by modify-ing ESU with SNAP (Stanford Network Analysis Platform) C++ library [28] to have three separate output options,
‘NemoCount (ESU)’, ‘NemoProfile’, and ‘NemoCollect’ as shown in the bottom box in the flowchart of Fig 2 We should note that the ‘SubgraphProfile’ is an intermedi-ate datum to generintermedi-ate ‘NemoProfile’ and ‘NemoCollect’ at the end
Fig 8 a This chart shows that NemoCollect takes significantly less time compared to AllCollect and QueryCollect, while collecting size 4 motif instances from various inputs; b It shows that AllCollect takes up significant time compared to other methods, and that NemoCollect method is the
most efficient when collecting size 5 motif instances
Trang 7The Author(s) BMC Bioinformatics 2017, 18(Suppl 12):423 Page 43 of 131
Table 2 Running time (in seconds) of NemoCollect, QueryCollect
and AllCollect with various motif sizes, as in Fig 8b
E:size3 S:size3 E:size4 S:size4 E:size5
NemoCollect 5.56 27.86 61.74 348.01 1768.40
QueryCollect 11.18 54.14 117.45 655.04 2504.40
AllCollect 17.74 159.53 2405.90 131.49 170,420.00
NemoCollect is the most efficient, while AllCollect becomes intractable with motifs
of larger sizes
Performances of NemoProfile and NemoCollect are
compared with NemoCount in various testing scenarios
by varying the size of the input graph, or by varying the
size of network motifs to detect Figure 7a demonstrates
NemoProfile and NemoCollect take almost the same time
as NemoCount(ESU) for size 4 network motifs in
vari-ous input graph sizes Time for detection of varivari-ous sizes
of network motifs is also compared in Fig 7b Inevitably,
NemoCollect takes slightly more time than others as the
size increases due to the additional instance collection
time However, time of NemoProfile is still similar to that
of ESU proving that it is efficiently generated but contains
much richer information than ESU
Next, we wanted to see if NemoProfile significantly
alleviates the memory overhead problem when
collect-ing network motif instances We design “NemoCollect”
process as shown in Algorithm 1 which uses
NemoPro-file in the process Since none of the existing network
motif finding algorithms collect network motif instances,
we designed a couple of alternatives to compare them with
NemoCollect: AllCollect is collecting all subgraphs while
searching network motifs, and QueryCollect is
collect-ing the instances of motifs uscollect-ing motif-centric method
Although the time of AllCollect is directly measured, the
time for QueryCollect method is estimated, assuming that
it will run ESU first to determine network motifs and
run MODA later to collect the instances of the network
motifs Since MODA takes as much time as ESU,
accord-ing to paper [25, 26], we estimated the time for
QueryCol-lect as twice that of ESU Figure 8a and b demonstrate
that NemoCollect is the most efficient method for motif
instance collection, even with an increase in motif size
Table 2 supplements Fig 8b to show the differences
clearly
Case study: essential protein prediction and NemoProfile
This section demonstrates the usability of NemoPro-file for real-world applications, specifically predicting essential proteins in a PPI network where network motif analysis has been applied previously [8, 9]
We used E coli (‘Ecoli20101010CR’) and S cere-visiae (‘Scere20101010CR’) PPI networks from DIP, and
obtained the list of essential proteins from Database of essential genes (DEG)[29] E coli has 121 essential pro-teins out of 1,231 nodes, and S cerevisiae contains 782 essential proteins out of 2,200 proteins
First, NemoProfile program provides the NemoProfile
matrix (A) of each network where the number in A ijrefers
the number of protein i overlaps with a motif j Here, five
network motifs are identified in both of the networks, and NemoProfile structure is directly converted to the set of attributes for each protein The data attributes along with the protein’s essentiality is fed into Weka program [30]
to run a decision tree (J48) algorithm to predict essential proteins
Figure 9 summarizes the overall process, from a PPI net-work, through NemoProfile, and the application of the decision tree technique to predict essential proteins of
an organism The classification is evaluated using 10-fold cross-validation scheme, and Fig 10 is one example of Weka results on S cerevisiae PPI
Conclusions
Several computationally costly tasks are required for net-work motif finding since netnet-work motifs are unique both structurally and statistically These tasks include enu-meration, classification, and statistical analysis Network-centric and motif-Network-centric methods exist for finding motifs While these methods have reduced computational costs, they have not overcome the prejudice towards network motifs in problem solving The doubtfulness as to the rele-vance of network motifs in biological problems continues due to the lack of usability with existing programs Therefore, we emphasized their usability by present-ing NemoProfile, an efficient network motif represen-tation Significant improvement is seen with the mem-ory overhead problem resolution and the reuse of NemoProfile to collect instances of motifs for direct appli-cation to existing problems Additionally, NemoProfile provides the output from other representations, including
Fig 9 Process to predict essential proteins in a PPI network A PPI network is processed to obtain NemoProfile matrix (A) that shows protein i has A ij
overlaps with motif j The matrix is directly loaded into a Weka program to run a decision tree algorithm and is evaluated through 10-fold
cross-validation method to provide prediction rate and ROC area value as a result
Trang 8Fig 10 Result of prediction of essential proteins with Weka, detected from S cerevisiae
the frequencies and statistical significance of subgraph
patterns
A NemoProfile program was constructed and used
to demonstrate the effectiveness of network motifs in
application to real world problems The experiment was
conducted using PPI networks and the results showed
that NemoProfile succinctly represents network motifs
and their instances with no extra computational costs
incurred With a favorable outcome in comparison with
other alternative methods NemoCollect is defined as the
process of collecting instances from NemoProfile The
outcome clearly demonstrates that the performance is
sig-nificantly better than the alternatives A usability focused
case-study of NemoProfile was performed to predict
essential proteins in PPI networks According to the study,
the application of machine learning algorithms can be
eas-ily applied to NemoProfile by first converting it to data
feature space
Future works on NemoProfile include three main tasks
First, the design of a framework to enhance the
applica-tion of NemoProfile to current and future problems, thus
reducing prejudice towards network motif analysis in the
field of biology Second, enhance the NemoCollect process
using parallelization by leveraging each separate column
in NemoProfile Third, improve the NemoCollect process
using a symmetry breaking or mapping process
Abbreviations
DEG: Database of essential genes; DIP: Database of interacting proteins; ESU:
Enumerate SUbgraphs; NemoCollect: Network motif instance collection;
NemoCount: Network motif count; NemoProfile: Network motif profile; PPI:
Protein-protein interaction; SNAP: Stanford network analysis platform
Acknowledgements
The authors would like to thank the University of Washington Bothell for
supporting this research program.
Funding
Publication of this article was funded by the school of STEM at the University
of Washington at Bothell (UWB), which played a role in the design of the study
by providing an interdisciplinary research environment.
Availability of data and materials
The NemoProfile source and documentation is available at http://faculty washington.edu/kimw6/research.htm The datasets during the current study are available in the DIP (the Database of Interacting Proteins) [27] repository.
About this supplement
This article has been published as part of BMC Bioinformatics Volume 18
Supplement 12, 2017: Selected articles from the 12th International Symposium
on Bioinformatics Research and Applications (ISBRA-16): bioinformatics The full contents of the supplement are available online at https://bmcbioinformatics biomedcentral.com/articles/supplements/volume-18-supplement-12.
Authors’ contributions
WK conceived of the study, designed and tested the methods, and wrote the manuscript LH conceived of the study, implemented the methods, and wrote the manuscript All authors read and approved the final manuscript.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Published: 16 October 2017
References
1 Callebaut W Scientific perspectivism: a philosopher of science’s response
to the challenge of big data biology Stud Hist Philos Biol Biomed Sci 2012;43(1):69–80.
2 Junker BH, Schreiber F Analysis of Biological Networks Wiley; 2008.
Trang 9The Author(s) BMC Bioinformatics 2017, 18(Suppl 12):423 Page 45 of 131
3 Albert I, Albert R Conserved network motifs allow protein-protein
interaction prediction Bioinformatics 2004;20(18):3346–52.
4 Chen J, Hsu W, Lee ML, Ng SK Labeling network motifs in protein
interactomes for protein function prediction Data Eng Int Conf.
2007;0:546–55.
5 Zhang Y, Xuan J, de los Reyes BG, Clarke R, Ressom HW Network
motif-based identification of breast cancer susceptibility genes In: 2008
30th Annual International Conference of the IEEE Engineering in
Medicine and Biology Society Vancouver; 2008 p 5696–9.
6 Conant GC, Wagner A Convergent evolution of gene circuits Nat Genet.
2003;34(3):264–6.
7 Wuchty S, Oltvai ZN, Barabasi AL Evolutionary conservation of motif
constituents in the yeast protein interaction network Nat Genet.
2003;35(2):176–9.
8 Kim W Prediction of essential proteins using topological properties in
go-pruned ppi network based on machine learning methods Tsinghua
Sci Technol 2012;17(6):645–58.
9 Kim W, Li M, Wang J, Pan Y Essential protein discovery based on
network motif and gene ontology In: Proceedings of IEEE Bioinformatics
and Biomedicine Atlanta; 2011 p 470–5.
10 Dobrin R, Beg QK, Barabasi AL, Oltvai ZN Aggregation of topological
motifs in the escherichia coli transcriptional regulatory network BMC
Bioinformatics 2004;5:10.
11 Zhang L, King O, Wong S, Goldberg D, Tong A, Lesage G, Andrews B,
Bussey H, Boone C, Roth F Motifs, themes and thematic maps of an
integrated saccharomyces cerevisiae interaction network J Biol.
2005;4(2):6.
12 Milo R, Itzkovitz S, Kashtan N, Levitt R, Shen-Orr S, Ayzenshtat I,
Sheffer M, Alon U Superfamilies of evolved and designed networks.
Science 2004;303(5663):1538–42.
13 Przulj N, Corneil DG, Jurisica I Modeling interactome: scale-free or
geometric? Bioinformatics 2004;20(18):3508–15.
14 Lee WP, Jeng BC, Pai TW, Tsai CP, Yu CY, Tzou WS Differential
evolutionary conservation of motif modes in the yeast protein interaction
network BMC Genomics 2006;7(1):89.
15 Xie ZR, Hwang MJ An interaction-motif-based scoring function for
protein-ligand docking BMC Bioinformatics 2010;11(1):298.
16 Kim W, Diko M, Rawson K Network motif detection: Algorithms, parallel
and cloud computing, and related tools Tsinghua Sci Technol.
2013;18(5):469–89.
17 Wernicke S Efficient detection of network motifs IEEE/ACM Trans
Comput Biol Bioinform 2006;3(4):347–59.
18 Ribeiro P, Silva F, Kaiser M Strategies for network motifs discovery.
In: e-Science, 2009 e-Science ’09 Fifth IEEE International Conference On.
Oxford; 2009 p 80–7.
19 Kashtan N, Itzkovitz S, Milo R, Alon U Efficient sampling algorithm for
estimating sub-graph concentrations and detecting network motifs.
Bioinformatics 2004;20:1746–58.
20 Wernicke S, Rasche F Fanmod: a tool for fast network motif detection.
Bioinformatics 2006;22:1152–3.
21 Kashani Z, Ahrabian H, Elahi E, Nowzari-Dalini A, Ansari E, Asadi S,
Mohammadi S, Schreiber F, Masoudi-Nejad A Kavosh: a new algorithm
for finding network motifs BMC Bioinformatics 2009;10(1):318.
22 Schreiber F, Schwobbermeyer H Mavisto: a tool for the exploration of
network motifs Bioinformatics 2005;21:3572–4.
23 Chen J, Hsu W, Lee M, Ng S Nemofinder: Dissecting genome-wide
protein-protein interactions with meso-scale network motifs In:
Proceedings of the 12th ACM SIGKDD international conference on
Knowledge discovery and data mining New York: ACM SIGKDD; 2006.
p 106–15.
24 Grochow JA, Kellis M Network motif discovery using subgraph
enumeration and symmetry-breaking In: Proceedings of the 11th Annual
International Conference on Research in Computational Molecular
Biology Berlin; 2007 p 92–106 RECOMB’07.
25 Omidi S, Schreiber F, Masoudi-Nejad A Moda: An efficient algorithm for
network motif discovery in biological networks Genes Genet Syst.
2009;84(5):385–95.
26 Wong E, Baur B, Quader S, Huang CH Biological network motif
detection: principles and practice Brief Bioinform 2012;13(2):202–15.
27 Xenarios I, Salwinski L, Duan XJ, Higney P, Kim SM, Eisenberg D Dip,
the database of interacting proteins: a research tool for studying cellular
networks of protein interactions Nucleic Acids Res 2002;30(1): 303–5.
28 Leskovec J, Sosiˇc R SNAP: A General Purpose Network Analysis and Graph Mining Library ACM Trans Intell Syst Technol 2016;8(1):1 doi:10.1145/2898361.
29 Zhang R, Ou HY, Zhang CT Deg: a database of essential genes Nucleic Acids Res 2004;32(suppl 1):271–2.
30 Frank E, Hall M, Trigg L, Holmes G, Witten IH Data mining in bioinformatics using weka Bioinformatics 2004;20(15):2479–81.
31 Harary F, Palmer EM Graphical Enumeration: Academic Press; 1973.
pp 2479–81 ISBN:0-12-324245-2.
• We accept pre-submission inquiries
• Our selector tool helps you to find the most relevant journal
• We provide round the clock customer support
• Convenient online submission
• Thorough peer review
• Inclusion in PubMed and all major indexing services
• Maximum visibility for your research Submit your manuscript at
www.biomedcentral.com/submit Submit your next manuscript to BioMed Central and we will help you at every step: