![]() |
STAG Python
1.2.1
Spectral Toolkit of Algorithms for Graphs
|
Functions | |
List[int] | spectral_cluster (graph.Graph g, int k) |
Spectral clustering algorithm. | |
List[int] | local_cluster (graph.LocalGraph g, int seed_vertex, float target_volume) |
Local clustering algorithm based on personalised Pagerank. | |
List[int] | local_cluster_acl (graph.LocalGraph g, int seed_vertex, float locality, float error=0.001) |
The ACL local clustering algorithm. | |
Tuple[scipy.sparse.csc_matrix, scipy.sparse.csc_matrix] | approximate_pagerank (graph.LocalGraph g, scipy.sparse.csc_matrix seed_vector, float alpha, float epsilon) |
Compute the approximate pagerank vector. | |
List[int] | sweep_set_conductance (graph.LocalGraph g, scipy.sparse.csc_matrix v) |
Find the sweep set of the given vector with the minimum conductance. | |
float | adjusted_rand_index (List[int] gt_labels, List[int] labels) |
Compute the Adjusted Rand Index between two label vectors. | |
float | conductance (graph.LocalGraph g, List[int] cluster) |
Compute the conductance of the given cluster in a graph. | |
List[int] stag.cluster.spectral_cluster | ( | graph.Graph | g, |
int | k | ||
) |
Spectral clustering algorithm.
This is a simple graph clustering method, which provides a clustering of the entire graph. To use spectral clustering, simply pass a stag.graph.Graph
object and the number of clusters you would like to find.
The spectral clustering algorithm has the following steps.
g | the graph object to be clustered |
k | the number of clusters to find. Should be less than \(n/2\). |
List[int] stag.cluster.local_cluster | ( | graph.LocalGraph | g, |
int | seed_vertex, | ||
float | target_volume | ||
) |
Local clustering algorithm based on personalised Pagerank.
Given a graph and starting vertex, return a cluster which is close to the starting vertex.
This method uses the ACL local clustering algorithm.
g | a graph object implementing the LocalGraph interface |
seed_vertex | the starting vertex in the graph |
target_volume | the approximate volume of the cluster you would like to find |
List[int] stag.cluster.local_cluster_acl | ( | graph.LocalGraph | g, |
int | seed_vertex, | ||
float | locality, | ||
float | error = 0.001 |
||
) |
The ACL local clustering algorithm.
Given a graph and starting vertex, returns a cluster close to the starting vertex, constructed in a local way.
The locality parameter is passed as the alpha parameter in the personalised pagerank calculation.
g | a graph object implementing the LocalGraph interface |
seed_vertex | the starting vertex in the graph |
locality | a value in \([0, 1]\) indicating how 'local' the cluster should be. A value of \(1\) will return the return only the seed vertex and a value of \(0\) will explore the whole graph. |
error | (optional) - the acceptable error in the calculation of the approximate pagerank. Default \(0.001\). |
Tuple[scipy.sparse.csc_matrix, scipy.sparse.csc_matrix] stag.cluster.approximate_pagerank | ( | graph.LocalGraph | g, |
scipy.sparse.csc_matrix | seed_vector, | ||
float | alpha, | ||
float | epsilon | ||
) |
Compute the approximate pagerank vector.
The parameters s, alpha, and epsilon are used as described in the ACL paper.
Note that the dimension of the returned vectors may not match the true number of vertices in the graph provided since the approximate pagerank is computed locally.
g | a stag.graph.LocalGraph object |
seed_vector | the seed vector of the personalised pagerank |
alpha | the locality parameter of the personalised pagerank |
epsilon | the error parameter of the personalised pagerank |
By the definition of approximate pagerank, it is the case that p + ppr(r, alpha) = ppr(s, alpha).
argument_error | if the provided seed_vector is not a column vector. |
List[int] stag.cluster.sweep_set_conductance | ( | graph.LocalGraph | g, |
scipy.sparse.csc_matrix | v | ||
) |
Find the sweep set of the given vector with the minimum conductance.
First, sort the vector such that \(v_1, \ldots, v_n\). Then let
\[ S_i = \{v_j : j <= i\} \]
and return the set of original indices corresponding to
\[ \mathrm{argmin}_i \phi(S_i) \]
where \(\phi(S)\) is the conductance of \(S\).
This method is expected to be run on vectors whose support is much less than the total size of the graph. If the total volume of the support of vec is larger than half of the volume of the total graph, then this method may return unexpected results.
Note that the caller is responsible for any required normalisation of the input vector. In particular, this method does not normalise the vector by the node degrees.
g | a stag.graph.LocalGraph object |
v | the vector to sweep over |
float stag.cluster.adjusted_rand_index | ( | List[int] | gt_labels, |
List[int] | labels | ||
) |
Compute the Adjusted Rand Index between two label vectors.
gt_labels | the ground truth labels for the dataset |
labels | the candidate labels whose ARI should be calculated |
float stag.cluster.conductance | ( | graph.LocalGraph | g, |
List[int] | cluster | ||
) |
Compute the conductance of the given cluster in a graph.
Given a graph \(G = (V, E)\), the conductance of \(S \subseteq V\) is defined to be
\[ \phi(S) = \frac{w(S, V \setminus S)}{\mathrm{vol}(S)}, \]
where \(\mathrm{vol}(S) = \sum_{v \in S} \mathrm{deg}(v)\) is the volume of \(S\) and \(w(S, V \setminus S)\) is the total weight of edges crossing the cut between \(S\) and \(V \setminus S\).
g | a stag.graph.LocalGraph object representing \(G\). |
cluster | a list of node IDs in \(S\). |