WatCon: A Python Tool for Analysis of Conserved Water Networks Across Protein Families

18 April 2025, Version 2
This content is a preprint and has not undergone peer review at the time of posting.

Abstract

Water structure is crucially important to protein function and catalysis and can be conserved throughout related proteins despite differences in sequence. The complex hydrogen bonding networks formed by water molecules and protein residues has been studied extensively, with graph theory-based methods frequently used to describe these networks. Although there exist a number of tools which can be used to track water positions and networks, corresponding methods for easily analyzing complex water network structure across related proteins are limited. To address this challenge, we present here a new tool, WatCon, an open-source Python package which can be used to analyze water positions and water network structure across protein families, using both dynamic and static structural information. Importantly, WatCon can be used to classify conservation of water networks, characterize water networks across structures, and project subsequent results for easy visual interpretation. To illustrate WatCon usage, we provide four example applications: a trajectory analysis demonstration using the protein tyrosine phosphatase (PTP1B), a demonstration of static and dynamic information comparison using PTP1B, a demonstration of a cross-protein family analysis using classical non-receptor type PTPs, and a family analysis using triosephosphate isomerase (TPI). This in turn showcases the utility of WatCon for enhancing our understanding of biochemical systems, predicting water hotspots of potential relevance to protein engineering, and predicting pathogenic mutations. WatCon can be downloaded at https://github.com/kamerlinlab/WatCon, and is available under the GNU General Public License v3.0.

Keywords

Water Networks
Graph Theory
Protein Tyrosine Phosphatases
Protein Structure
Water Conservation

Supplementary materials

Title
Description
Actions
Title
Supporting Information
Description
Additional Simulation Data and Analysis
Actions

Supplementary weblinks

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting Policy [opens in a new tab] - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here [opens in a new tab] .
This site is protected by reCAPTCHA and the Google Privacy Policy [opens in a new tab] and Terms of Service [opens in a new tab] apply.