About
I’m an engineer turned 2x founder who’s spent over a decade building B2B products. Today,…
Articles by Sachin
Activity
-
Thrilled to announce that Savant Labs now powers their inbound pipeline with Breakout! Savant automates data, analytics, and finance workflows with…
Thrilled to announce that Savant Labs now powers their inbound pipeline with Breakout! Savant automates data, analytics, and finance workflows with…
Shared by Sachin Gupta
-
The boys are back! Innovaccer is at #JPM2026 discussing all things autonomous healthcare! Our CEO Abhinav Shashank is presenting business updates…
The boys are back! Innovaccer is at #JPM2026 discussing all things autonomous healthcare! Our CEO Abhinav Shashank is presenting business updates…
Liked by Sachin Gupta
-
In my mind, the launch of ChatGPT Atlas signified a big shift for marketing. I have three main predictions about what Atlas and Google's quick…
In my mind, the launch of ChatGPT Atlas signified a big shift for marketing. I have three main predictions about what Atlas and Google's quick…
Liked by Sachin Gupta
Experience
Education
-
Indian Institute of Technology, Roorkee
93.9
-
Activities and Societies: National Service Scheme, Program Management Section(Cultural Council), Cognizance (Technical fest
-
-
-
Publications
-
Efficient Variable Size Template matching Using Fast Normalized Cross Correlation on Multicore Processors
LNCS Springer
Normalized Cross Correlation (NCC) is an efficient and robust way for finding the location of a template in given image. However NCC is computationally expensive. Fast normalized cross correlation (FNCC) makes use of pre-computed sum-tables to improve the computational efficiency of NCC. In this paper we propose a strategy for parallel implementation of FNCC algorithm using NVIDIA’s Compute Unified Device Architecture (CUDA) for real-time template matching. We also present an approach to make…
Normalized Cross Correlation (NCC) is an efficient and robust way for finding the location of a template in given image. However NCC is computationally expensive. Fast normalized cross correlation (FNCC) makes use of pre-computed sum-tables to improve the computational efficiency of NCC. In this paper we propose a strategy for parallel implementation of FNCC algorithm using NVIDIA’s Compute Unified Device Architecture (CUDA) for real-time template matching. We also present an approach to make proposed method adaptable to variable size templates which is an important challenge to tackle. Efficient parallelization strategies adopted for pre-computing sum-tables and extracting data parallelism by dividing the image into series of blocks substantially reduce required computational time. We show that by optimal utilization different memories available on CUDA and using idling time of host CPU to perform independent tasks we can obtain the speedup of the order of 17X as compared to the sequential implementation.
Other authorsSee publication -
Motion Detection in Low Resolution Grayscale Videos Using Fast Normalized Cross Correrelation on GP-GPU
ICAISC, Bhuvaneshwar
Motion estimation (ME) has been widely used in many computer vision applications, such as object tracking, object detection, pattern recognition and video compression. The most popular block based similarity measures are the sum of absolute differences (SAD), the sum of squared differences (SSD) and the normalized cross correlation (NCC). Similarity measure obtained using NCC is more robust under varying illumination changes as compared to SAD and SSD. However NCC is computationally expensive…
Motion estimation (ME) has been widely used in many computer vision applications, such as object tracking, object detection, pattern recognition and video compression. The most popular block based similarity measures are the sum of absolute differences (SAD), the sum of squared differences (SSD) and the normalized cross correlation (NCC). Similarity measure obtained using NCC is more robust under varying illumination changes as compared to SAD and SSD. However NCC is computationally expensive and application of NCC using full or exhaustive search method further increases required computational time. Relatively efficient way of calculating the NCC is to pre-compute sum-tables to perform the normalization referred to as fast NCC (FCC). In this paper we propose real time implementation of full search FCC algorithm applied to gray scale videos using NVIDIA’s Compute Unified Device Architecture (CUDA). We present fine-grained optimization techniques for fully exploiting computational capacity of CUDA. Novel parallelization strategies adopted for extracting data parallelism substantially reduce computational time of exhaustive FCC. We show that by efficient utilization of global, shared and texture memories available on CUDA, we can obtain the speedup of the order of 10x as compared to the sequential implementation of FCC.
Other authorsSee publication
Courses
-
Compilers
-
-
Database Management Systems
-
-
Operating System
-
-
Operating System
-
Honors & Awards
-
Forbes 30 under 30
Forbes
Awarded as Forbes 30 under 30 in the Enterprise Tech category for Asia.
-
Forbes 30 under 30
Forbes
Recognized in Forbes 30 under 30 for Enterprise software.
Languages
-
English
Native or bilingual proficiency
-
Hindi
Native or bilingual proficiency
More activity by Sachin
-
Today I start a new chapter in my career. I’m heading to Florida for Syncro’s revenue kick off, and with it, I’m officially stepping into my new…
Today I start a new chapter in my career. I’m heading to Florida for Syncro’s revenue kick off, and with it, I’m officially stepping into my new…
Liked by Sachin Gupta
-
Today, I’m officially joining Manhattan Associates as Chief Marketing Officer. I’m motivated by the chance to build a resilient, modern marketing…
Today, I’m officially joining Manhattan Associates as Chief Marketing Officer. I’m motivated by the chance to build a resilient, modern marketing…
Liked by Sachin Gupta
-
Aircall why you no offer LLM-powered SDR 🤔 we recently hired a SDR, hence looking for a calling solution. For some reason Aircall was top of mind…
Aircall why you no offer LLM-powered SDR 🤔 we recently hired a SDR, hence looking for a calling solution. For some reason Aircall was top of mind…
Shared by Sachin Gupta
-
🧠 EVALS ARE A COMPETITIVE MOAT 🧠 some thoughts on ai evals: - evals are a competitive moat. robust evals -> ability to dial in performance of…
🧠 EVALS ARE A COMPETITIVE MOAT 🧠 some thoughts on ai evals: - evals are a competitive moat. robust evals -> ability to dial in performance of…
Liked by Sachin Gupta
-
We’re excited to welcome Doug Pullman as Pebl’s new Chief Marketing Officer. With over a decade of experience scaling marketing across SaaS and…
We’re excited to welcome Doug Pullman as Pebl’s new Chief Marketing Officer. With over a decade of experience scaling marketing across SaaS and…
Liked by Sachin Gupta
Other similar profiles
Explore top content on LinkedIn
Find curated posts and insights for relevant topics all in one place.
View top content