PI_logo

Physicochemical n-Grams Tool (PnGT) Tutorial

Introduction


Physicochemical n-Grams Tool is an open source standalone software for calculating physicochemical descriptors of protein developed using the Python scripting language.

The algorithms used in the development of this tool are based on the n-Grams curation techniques which are mainly used for natural language processing. Apart from language processing, now a days n-Grams modelling is also making a room in the computational biology and genomics.

The software currently calculates 33 physicochemical descriptors and the sequence length for given protein primary sequence. To find more specific information about our tool, please contact us.

Figure.1 Physicochemical Amino Acids Property Groups

Utility: The tool is able to assess the physicochemical properties of the protein in an easier way with bi-Grams, tri-Grams, tetra-Grams assessment of protein primary sequence.

Assessment of the physicochemical n-Grams properties of the protein: With the help of this PnGT tool, the analysis of proteomics data in the term of physicochemical n-Grams statistics is easy-to-do with the only primary sequence as feeding.

Implementation in machine learning model development as feeding: The outcomes in terms of physicochemical feature vectors from the Physicochemical n-Grams Tool can be directly implemented for the development of statistical (PCA, Regression Analysis) or machine learning model like SVM, kNN etc.

Keywords: Proteomics science, physicochemical parameter, n-Grams, machine learning, statistical modelling.

Physicochemical n-Grams Descriptors and Calculations: Conservation of multiple physicochemical groups along the sequence in a sliding window of length n residue. For example, physicochemical 2-Grams: Aliphatic++; where n equals to 2

if〖aa〗i ∈S* AND 〖aa〗(i+1)∈S*

where, aai = ith residue of the protein

Installation Steps for User


1. PnGT-Installer: To download “PnGT-Installer” go to Download tab and download PnGT-Installer to install Physicochemical n-Grams Tool.


2. Extract the zip file and open it, then locate “PnGT” as shown in Figure 2 below.

Figure.2 PnGT Installer

Home Page


1. The PnGT will open as shown in Figure 3 below.

Figure.3 PnGT Home Page

Physicochemical n-Grams Descriptors Calculation