Table Of Contents
Venture Capital – Startup Network

Original Documents:
 Initial Report PDF
 Initial input data file Excel File( ), CSV File( )
 git repo /home/ec2user/GitRepo/R/VentureCapital.git/ ec2:/home/ec2user/GitRepo/R/VentureCapital.git ec2: is a alias

Clustering
 For Firms FirmCluster.json
 For Companies

Command to import data:
 Change directory to mongo bin path.
 ./mongoimport d ndataconsulting c VCDeals –type csv –file /home/ntreees/VCdeals.csv –headerline
 This above command import the csv file to ndataconsulting database and within that uses VCDeals collection.
 Indexes single column:
 db.VCDeals.ensureIndex({“companyname”:1})
 db.VCDeals.ensureIndex({“datefund”:1})
 db.VCDeals.ensureIndex({“firmname”:1})
 db.VCDeals.ensureIndex({“companysituation”:1})
 db.VCDeals.ensureIndex({“companypublicstatus”:1})
 db.VCDeals.ensureIndex({“companystatecode”:1})

iGraph R Module
 http://igraph.wikidot.com/communitydetectioninr
 IGraph Community Detection Details http://www.rbloggers.com/summaryofcommunitydetectionalgorithmsinigraph06/
 iGraph Documentation ()
 iGraph Tutorial (http://igraph.sourceforge.net/igraphbook/igraphbookdatamodel.html)
 Drawing Graph (http://horicky.blogspot.com/2012/04/basicgraphanalyticsusingigraph.html)
 Psuedo Inverse()
 Get Adjency Graph(http://stackoverflow.com/questions/14849835/howtocalculateadjacencymatricesinr)
 layout=layout.fruchterman.reingold Force Based Implementation
 Drawing Graph
 Degree of Graph
 Laplacian of Graph
 Example to create graph from data http://igraph.sourceforge.net/igraphbook/igraphbookcreating.html
 Integration With R
 Interesting Snippets
 http://www.rbloggers.com/networkvisualizationinrwiththeigraphpackage/
 http://markov.uc3m.es/2012/11/temporalnetworkswithigraphandrwith20linesofcode/
 http://rdatamining.wordpress.com/2012/05/17/anexampleofsocialnetworkanalysiswithrusingpackageigraph/
 http://nsaunders.wordpress.com/2010/04/21/experimentswithigraph/
 http://stackoverflow.com/questions/9876267/rigraphcommunitydetectionedgebetweennessmethodcountlistmembersofe
 http://rulesofreason.wordpress.com/2012/11/05/networkvisualizationinrwiththeigraphpackage/
 http://somelab.net/2012/11/howtocreateanetworkanimationwithrandtheigraphpackage/

Detail about the algorithm (NewmanGirvan cohesionbased clustering algorithm) :
 Is a algorithm which is used to find Community Structure. For a community structure n0rmally a set of nodes are densely connected to each other form a group and these groups are sparsely connected to other groups. Basically nodes will be more likely to be connected to each other if they are in the same community and less likely if in different communities.
 The algorithm works by finding an edge between communities and then removes these edges leaving behind only the communities themselves. For this it uses is Betweenness.
 Betweenness assigns a large number to edges if they are between many pair of nodes.
 Popular but slow takes O(m^{2}n) on a network of n vertices and m edges making it impractical for a large set of nodes.
 It focuses on these edges that are least central, the edges that are most “between” communities. The communities are detected by progressively removing edges from the original graph
 If a network contains communities or groups that are only loosely connected by a few intergroup edges, then all shortest paths between different communities must go along one of these few edges. Thus, the edges connecting communities will have high edge betweenness (at least one of them). By removing these edges, the groups are separated from one another and so the underlying community structure of the network is revealed.
 http://www.sixhat.net/findingcommunitiesinnetworkswithrandigraph.html
 http://open.umich.edu/education/si/si508/fall2008/materials#Labs