A collection of toy command-line tools written in Swift for bioinformatic data wrangling.
Each tool is a subcommand of biosw.


Command line

  1. Convert GTF to BED format
biosw gtf2bed gencode.gtf out.bed
  1. Convert VCF to HBCGM (eblocks) format
biosw hbcgm snp.vcf out.txt
  1. More will be added if worth to.

Swift REPL

As a library

import Bio
let testDataPath = "data"
let gtf = GTF(testDataPath+"/test.gtf")
gtf.toBed(filename: testDataPath+"/test.bed")



cd swift-bio
swift build
# build output in .build/debug

# test if build successfully
swift run biosw


# binary output in .build/release
swift build --configuration release


Use Bio as depency for your own package

To depend on the Bio API package, you need to declare your dependency in your Package.swift:

.package(url: "https://github.com/zqfang/swift-bio", from: "0.0.1"),

and to your application/library target, add Bio to your dependencies, e.g. like this:

.target(name: "YourApp", dependencies: ["Bio"]),


1. FileIO system in Swift quit different from C/C++ and Python et.al

Swift is slow when reading large text files with the code like

let path = Bundle.main.path(forResource:"test", ofType: "txt")
let lines = try? String(contentsOfFile: path!)
                     .split{$0 == "\n"}

NOTE: Since String reads the whole file once, instead of line by line, it took a long time to read large size files. Reading file line by line is much more efficiently, but you have to write your own parser.

2. An experimental 2D Array with Numpy-like indexing and slicing

Array2D is only for testing purpose. An example of swift code:

init and assign values

import Bio
// init 2d array
var arr = Array2D<Int>(rows:10, columns:5, initialValue: 0 )

// init an Array2D by another Array2D instance
let arr2 = Array2D<Int>(arr)

// init a Array2D by generic 2d arry
let arr3 = Array2D<Dobule>([[1.0,2,0],[3.0,4.0]])

// assign value
arr[1,4] = 1 
arr[5,2] = 6

print(arr[5]) // [0, 0, 6, 0, 0]

Numpy-like indexing and slicing

[[0, 0, 0, 0, 1], 
 [0, 0, 0, 0, 0]]

// [0, 0, 0, 0, 0]

print(arr[1..<6, 3..<5]) 
[[0, 1], 
 [0, 0], 
 [0, 0], 
 [0, 0], 
 [0, 0]]

Generate a template Swift package called BioSwift

mkdir BioSwift && cd BioSwift
swift package init --type executable
swift package generate-xcodeproj # xcode