Converts a text file of lines "<id1><whitespace><id2>" into a binary memory mapped graph.
Converts a text file of lines "<id1><whitespace><id2>" into a binary memory mapped graph. Any duplicate edges are not included in the output. Empty lines are ignored. Progress is logged using the given function. See the comment in MemoryMappedDirectedGraph for the interpretation of segmentCount (the default value should work well).
The convert method converts a graph from a text file of unsorted edges to a binary file which can be efficiently read using MemoryMappedDirectedGraph. This can also be run from the command line; for example: sbt assembly printf "1 2\n3 4\n1 3" > input_graph.txt java -Xmx7g -cp target/scala-2.11/bidirectional-random-walk-assembly-1.0.jar\ co.teapot.tempest.graph.MemoryMappedDirectedGraphConverter input_graph.txt output_graph.dat
Then in scala code, MemoryMappedDirectedGraph("output_graph.dat") will efficiently read the graph.