Welcome to DRIVE’s documentation!#
Distant Relatedness for Identification and Variant Evaluation (DRIVE) is a novel approach to IBD-based genotype inference used to identify shared chromosomal segments in dense genetic arrays. DRIVE utilizes a community walktrap algorithm that identifies clusters of individuals who share a pairwise IBD segment overlapping a locus of interest. This tool was written in Python and developed by the Below Lab at Vanderbilt University.
Quick Installation:#
Installing DRIVE: The easiest way to install DRIVE is from the PyPI register. Users can use the following command to install the program. This method ensures that all the necessary dependencies are installed. This method assumes you have either made a virtualenv (venv) or a conda environment to install the program into. To read more about this install method read the Pip Installation section.
pip install drive-ibd
Important
DRIVE supports Python versions >=3.10 (but not Python version 3.11.0) for compatibility between packages. It is expected that your Python version falls within this range when attempting to install the software. You can check the Python version using the ‘python –version’ command. If the version number is outside of the acceptable range then you must install an appropriate version from sources such as Python.org, or a package manager such as Homebrew on MacOS, Conda, or the appropriate Linux package manage if you are running Linux. Additionally, DRIVE doesn’t support the development version of Python 3.13t that allows users to disable the GIL due to some packages not being fully compatible yet.
Check Version: Once you have installed DRIVE you should check the version to ensure that you have the most recent code. At the time of writing the newest version is 3.1.0 but you can also check the PYPI Badge at the top of the GitHub README.md. This badge will display the most recent version on PYPI. This check is important because the testing framework was not added to DRIVE until v3 so running the test with an older version of DRIVE will result in an error (see FAQ).
drive --version
Run tests to check successful installation:#
Once you have ensured that the correct version is installed then you can run the integration tests using the following command.
drive utilities test
These tests use simulated IBD data to check that DRIVE is identifying the correct networks and is creating the appropriate output files. Genotype data was then simulated using MSPrime, data was phased using SHAPEIT4, and pairwise IBD segments were detected using hap-IBD. You can read more about the simulations under the section called “Simulating IBD Data” here.
Warning
Users are more than welcome to repeat the “Simulating IBD Data” pipeline and use another IBD calling tool such as iLASH or GERMLINE. Commands to rerun this pipeline are under the “Simulating IBD Data” section. Each of these programs will have differences in the detected pairwise IBD segments which may affect the networks identified by DRIVE. Because of these innate differences in the IBD detection tools, we make no guarantee that the output will exactly match the output file, “test_drive_phenomewide_output.drive_networks.txt”, which can be found in the tests/test_output directory at the GitHub repository.
Citation:#
The following paper discusses an initial implementation of the clustering algorithm of DRIVE (1.0.0) and can be cited if you use the tool: Detection of distant relatedness in biobanks to identify undiagnosed cases of Mendelian disease as applied to Long QT Syndrome.
Contact:#
If you have any questions about DRIVE or run into issues, you can either post an issue on the GitHub issues page or you can contact us at the email address, insert email here.