Subgraphs¶

A SubGraph is a pipeline that behaves as a single node. You build it like a Graph, add nodes, wire edges, but it is constrained to exactly one input node and one output node, so it has the same one-in/one-out shape as an ordinary step and can be placed on an edge in an outer graph.

        flowchart LR
    IN(["input"]) --> SG
    subgraph SG["SubGraph"]
        direction LR
        A["tool A"] --> B["tool B"]
    end
    SG --> OUT["next node"]

Constraints¶

The single-in/single-out rule is enforced at build time: add_input_node raises if called twice, and set_output_node raises if called twice. The subgraph exposes get_input_node() and get_output_node() so the outer graph can attach edges to its endpoints, you pass the subgraph object to add_edge and biocomposer resolves the correct inner node automatically.

Use in an outer graph¶

from biocomposer import Graph, SubGraph

pre = SubGraph()
s_in  = pre.add_input_node(sequences="/vol/inputs/family.fasta")
align = pre.add_node("clustalo")
trim  = pre.add_node("trimal",
                     args_override="-in {alignment} -out {trimmed} -fasta -gappyout")
pre.add_edge((s_in, align), (align, trim))
pre.set_output_node(trim)

g = Graph()
tree = g.add_node("fasttree")
g.add_edge((pre, tree))      # subgraph's output feeds the next tool
g.set_output_node(tree)
g.execute()

Run standalone¶

A subgraph can also be executed directly with fresh inputs via run(inputs), which sets its single input node and runs:

pre.run({"sequences": "/vol/inputs/other_family.fasta"})

Reference¶

class biocomposer.SubGraph(inNodes: list = None)

Bases: Graph

add_input_node(**kwargs) → InputNode

get_input_node() → InputNode

get_output_node() → Node

run(inputs: dict)

set_output_node(node: Node = None)