Sachin Gupta

Sachin Gupta

San Jose, California, United States
25K followers 500+ connections

About

I’m an engineer turned 2x founder who’s spent over a decade building B2B products. Today,…

Articles by Sachin

Activity

Join now to see all activity

Experience

  • Breakout Graphic

    Breakout

    San Jose, California, United States

  • -

    San Francisco Bay Area

  • -

    San Francisco Bay Area

  • -

    India

  • -

    Hyderabad Area, India

Education

Publications

  • Efficient Variable Size Template matching Using Fast Normalized Cross Correlation on Multicore Processors

    LNCS Springer

    Normalized Cross Correlation (NCC) is an efficient and robust way for finding the location of a template in given image. However NCC is computationally expensive. Fast normalized cross correlation (FNCC) makes use of pre-computed sum-tables to improve the computational efficiency of NCC. In this paper we propose a strategy for parallel implementation of FNCC algorithm using NVIDIA’s Compute Unified Device Architecture (CUDA) for real-time template matching. We also present an approach to make…

    Normalized Cross Correlation (NCC) is an efficient and robust way for finding the location of a template in given image. However NCC is computationally expensive. Fast normalized cross correlation (FNCC) makes use of pre-computed sum-tables to improve the computational efficiency of NCC. In this paper we propose a strategy for parallel implementation of FNCC algorithm using NVIDIA’s Compute Unified Device Architecture (CUDA) for real-time template matching. We also present an approach to make proposed method adaptable to variable size templates which is an important challenge to tackle. Efficient parallelization strategies adopted for pre-computing sum-tables and extracting data parallelism by dividing the image into series of blocks substantially reduce required computational time. We show that by optimal utilization different memories available on CUDA and using idling time of host CPU to perform independent tasks we can obtain the speedup of the order of 17X as compared to the sequential implementation.

    Other authors
    See publication
  • Motion Detection in Low Resolution Grayscale Videos Using Fast Normalized Cross Correrelation on GP-GPU

    ICAISC, Bhuvaneshwar

    Motion estimation (ME) has been widely used in many computer vision applications, such as object tracking, object detection, pattern recognition and video compression. The most popular block based similarity measures are the sum of absolute differences (SAD), the sum of squared differences (SSD) and the normalized cross correlation (NCC). Similarity measure obtained using NCC is more robust under varying illumination changes as compared to SAD and SSD. However NCC is computationally expensive…

    Motion estimation (ME) has been widely used in many computer vision applications, such as object tracking, object detection, pattern recognition and video compression. The most popular block based similarity measures are the sum of absolute differences (SAD), the sum of squared differences (SSD) and the normalized cross correlation (NCC). Similarity measure obtained using NCC is more robust under varying illumination changes as compared to SAD and SSD. However NCC is computationally expensive and application of NCC using full or exhaustive search method further increases required computational time. Relatively efficient way of calculating the NCC is to pre-compute sum-tables to perform the normalization referred to as fast NCC (FCC). In this paper we propose real time implementation of full search FCC algorithm applied to gray scale videos using NVIDIA’s Compute Unified Device Architecture (CUDA). We present fine-grained optimization techniques for fully exploiting computational capacity of CUDA. Novel parallelization strategies adopted for extracting data parallelism substantially reduce computational time of exhaustive FCC. We show that by efficient utilization of global, shared and texture memories available on CUDA, we can obtain the speedup of the order of 10x as compared to the sequential implementation of FCC.

    Other authors
    See publication

Courses

  • Compilers

    -

  • Database Management Systems

    -

  • Operating System

    -

  • Operating System

    -

Honors & Awards

  • Forbes 30 under 30

    Forbes

    Awarded as Forbes 30 under 30 in the Enterprise Tech category for Asia.

  • Forbes 30 under 30

    Forbes

    Recognized in Forbes 30 under 30 for Enterprise software.

Languages

  • English

    Native or bilingual proficiency

  • Hindi

    Native or bilingual proficiency

More activity by Sachin

View Sachin’s full profile

  • See who you know in common
  • Get introduced
  • Contact Sachin directly
Join to view full profile

Other similar profiles

Explore top content on LinkedIn

Find curated posts and insights for relevant topics all in one place.

View top content

Add new skills with these courses