SlideShare una empresa de Scribd logo
1 de 24
Descargar para leer sin conexión
Lifting Scheme Cores for Wavelet Transform
David Barina
(supervised by Pavel Zemcik)
1 / 24
DWT in image processing
can be found in many image-processing tasks
analysis
(edge detection, feature extraction, multiscale representation),
compression (JPEG 2000, Dirac),
watermarking, edge sharpening, contrast enhancement,
tone mapping, denoising, fusion, etc.
2 / 24
Filter bank
S. Mallat, ”A theory for multiresolution signal decomposition: The wavelet representation” (1989)
˜H(z−1
) a
d
↓ 2
+
˜G(z−1
) ↓ 2
↑ 2 H(z)
↑ 2 G(z)
decomposition: two complementary filters,
high number of operations
3 / 24
Lifting scheme
I. Daubechies, W. Sweldens, ”Factoring wavelet transforms into lifting steps” (1998)
a
d
split ˜P(z−1
)T
P(z) merge
P(z) =
I−1
i=0
1 Si(z)
0 1
1 0
Ti(z) 1
K 0
0 1/K
decomposition: sequence of simple filtering steps,
reduces the number of operations, split: even, odd
4 / 24
CDF 9/7 wavelet
I. Daubechies, W. Sweldens, ”Factoring wavelet transforms into lifting steps” (1998)
α
β
γ
δ
input
output
steps
even samples
odd samples
˜P(z) =
1 α 1 + z−1
0 1
1 0
β (1 + z) 1
1 γ 1 + z−1
0 1
1 0
δ (1 + z) 1
ζ 0
0 1/ζ
four two-tap symmetric filters
5 / 24
2-D decomposition
S. Mallat, ”A theory for multiresolution signal decomposition: The wavelet representation” (1989)
a h
v d
horizontal vertical
h
v d
a h
dv
image: 2-D signal, by a series of 1-D transforms, four subbands,
multi-scale decomposition
6 / 24
Lenna
how to calculate this as efficiently as possible
7 / 24
Strategies and issues
R. Kutil, ”A single-loop approach to SIMD parallelization of 2-D wavelet lifting” (2006)
a h
v d
horizontal vertical
strategies row-column, block-based, and line-based
cache issues cache line, limited size, set associativity, prefetching
techniques padding, aggregation, memory layouts,
interleave loops, parallelization
the approaches have to repeatedly visit samples,
memory access is expensive ⇒ CPU cache, limitations,
existing techniques, single-loop approach
8 / 24
Unsolved issues
2 × 2
prolog
core
epilog
prolog epilog
F
F
FF
complicated border treatment (prolog/epilog phases)
suspend/resume processing
arbitrary processing order (scan order)
interleave the transform and a subsequent processing
multi-scale decomposition
reorganization of underlying scheme
9 / 24
Objectives of the thesis
Aims improve image transform performance and resource
consumption
Objectives eliminate the shortcomings of existing methods
previous slide
Evaluation prove experimentally
performance, memory requirements
10 / 24
Lifting core
D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press)
solution: a processing unit
continuously consumes an input and produces an output
which visits every image sample only once (cache friendly)
which is aware of image coordinates (can handle the borders)
whose configuration (state) can be saved/restored
which can be run in any direction
which can be SIMD vectorized
which can run in parallel (on independent parts of the image)
y = C x
x
def
= In B y
def
= On B
11 / 24
Core examples
D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press)
α
β
γ
δ m
n
1 2 3 4
core inputs, outputs
12 / 24
Processing orders
D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press)
horizontal horiz. strips horiz. blocks
vertical vert. strips vert. blocks
13 / 24
Borders treatment
D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press)
d a d a d a d a d a d a d a d a d a d
d a d a d a d a d a d a d a d a d a d a
n n n n n n n
a d aad
n nnnn
d a d a d a d a d a d a d a d a d a d
0
d a d a d a d a d a d a d a d a d a d a
2 n N − 2 N
0 0
n n n n n n
a
y = Cn x
cores gracefully treats the boundaries
14 / 24
Parallel cores and reorganization
M. Kula, D. Barina, et al., ”Block-based Approach to 2-D Wavelet Transform on GPUs” (2016)
1 2 3 4
Sweldens1995
1 2 3
Iwahashi2007
1 2
proposed
15 / 24
3-D core
D. Barina, P. Zemcik, ”Real-Time 3-D Wavelet Lifting” (2015)
x
y
z
buffer x
buffer y
buffer z
extended into more dimensions, buffers on the sides
16 / 24
CPU implementation
D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press)
0.0 s
5.0ns
10.0ns
15.0ns
20.0ns
25.0ns
30.0ns
35.0ns
40.0ns
45.0ns
50.0ns
1.0k 10.0k 100.0k 1.0M 10.0M 100.0M
time/pixel
pixels
separable approach
core approach
an evaluation of approaches,
implemented the separable, single-loop, and core
17 / 24
3-D CPU implementation
D. Barina, P. Zemcik, ”Real-Time 3-D Wavelet Lifting” (2015)
x
y
z
buffer x
buffer y
buffer z
0.0 s
20.0ns
40.0ns
60.0ns
80.0ns
100.0ns
120.0ns
140.0ns
160.0ns
0.0 50.0M 100.0M 150.0M 200.0M 250.0M
time/voxel
voxels
naive horizontal
naive vertical
core 42
core 23
core 43
performance of 3-D transform: separable, 2-D core, 3-D core
18 / 24
GPU implementation
M. Kula, D. Barina, et al., ”Block-based Approach to 2-D Wavelet Transform on GPUs” (2016)
80.0
100.0
120.0
140.0
160.0
180.0
200.0
220.0
240.0
260.0
0.0 10.0M 20.0M 30.0M 40.0M 50.0M 60.0M 70.0M
GB/s
pixels
Kucis2014
Separable Block
Non-Separable Block
0
10
20
30
40
50
60
100kpel 1Mpel 10Mpel 100Mpel
GB/s
Sweldens
Iwahashi*
Explosive*
Monolithic*
Polyphase*
Monolithic∗
scheme:
left: SotA is in red, block methods in blue/green, reorganization
right: block methods, separable in black, our in blue/green
19 / 24
FPGA implementation
D. Barina, et al., ”Single-Loop Approach to 2-D Wavelet Lifting with JPEG 2000 Compatibility” (2015)
H V
BRAM
Input Transform
core FF LUT BRAM
latency 4 441 (0.1 %) 399 (0.18 %) 6 (1.1 %)
latency 2 391 (< 0.1 %) 592 (0.27 %) 6 (1.1 %)
architecture device BRAM [bits] clocks/pel time [ms]
Dillen2003 VirtexE1000-8 50K 0.50 1.20
Descampe2004 Virtex-II XC2V6000 N/A 0.60 1.75
Seo2007 Altera Stratix 128K 2.64 6.02
Zhang2012 Virtex-II Pro XC2VP30 6 × 18K 0.50 0.97
the cores Zynq XC7Z045 1 × 36K 0.26 0.27
20 / 24
JPEG 2000 implementation
D. Barina, O. Klima, P. Zemcik, ”Single-Loop Architecture for JPEG 2000” (2016)
core
codeblock
2 × 2cn
2 × 2cm
aj
aj+1
h v d
0.0
20.0
40.0
60.0
80.0
100.0
120.0
140.0
100.0k 1.0M 10.0M 100.0M 1.0G
time[ns]
resolution [pel]
proposed
OpenJPEG
JasPer
FFmpeg
21 / 24
Contributions of the thesis
Aims improved image transform performance and resource
consumption
Objectives eliminated the shortcomings of existing methods
Evaluation assessed experimentally
(performance, memory requirements)
evaluation performed:
2-D on CPU, 3-D on CPU, 2-D on GPU, 2-D on FPGA,
JPEG 2000 on CPU
22 / 24
Selected papers
Barina, D.; Klima, O.; Zemcik, P.: Single-Loop Software Architecture for JPEG 2000. In
Data Compression Conference (DCC), 2016
Barina, D.; Musil, M.; Musil, P.; et al.: Single-Loop Approach to 2-D Wavelet Lifting with
JPEG 2000 Compatibility. In Workshop on Applications for MultiCore Architectures
(WAMCA), 2015
Barina, D.; Zemcik, P.: Minimum Memory Vectorisation of Wavelet Lifting. In Advanced
Concepts for Intelligent Vision Systems (ACIVS), 2013
Barina, D.; Zemcik, P.: Wavelet Lifting on Application Specific Vector Processor. In
GraphiCon, 2013
Barina, D.; Zemcik, P.: Diagonal Vectorisation of 2-D Wavelet Lifting. In IEEE International
Conference on Image Processing (ICIP), 2014
Barina, D.; Zemcik, P.: Real-Time 3-D Wavelet Lifting. In International Conference in
Central Europe on Computer Graphics, Visualization and Computer Vision (WSCG), 2015
Barina, D.; Zemcik, P.: Vectorization and parallelization of 2-D wavelet lifting. Journal of
Real-Time Image Processing (JRTIP), in press
Barina, D.; Klima, O.; Zemcik, P.: Single-Loop Architecture for JPEG 2000. In: Image and
Signal Processing (ICISP), 2016
Kula, M.; Barina, D.; Zemcik, P.: Block-based Approach to 2-D Wavelet Transform on GPUs.
In International Conference on Information Technology – New Generations (ITNG), 2016
Kucis, M.; Barina, D.; Kula, M.; et al.: 2-D Discrete Wavelet Transform Using GPU. In
Workshop on Application for Multi-Core Architectures (WAMCA), 2014
23 / 24
Summary
the core
computing unit which processes the data in a single pass,
can suspend/resume execution,
can processes the data in many different orders,
can handle signal boundaries (is aware of coordinates),
can be easily SIMD vectorized and parallelized,
and whose underlying scheme can be reorganized.
24 / 24

Más contenido relacionado

La actualidad más candente

Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Editor IJARCET
 
Discrete cosine transform
Discrete cosine transformDiscrete cosine transform
Discrete cosine transformaniruddh Tyagi
 
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...ijsrd.com
 
Iaetsd wavelet transform based latency optimized image compression for
Iaetsd wavelet transform based latency optimized image compression forIaetsd wavelet transform based latency optimized image compression for
Iaetsd wavelet transform based latency optimized image compression forIaetsd Iaetsd
 
Wavelet video processing tecnology
Wavelet video processing tecnologyWavelet video processing tecnology
Wavelet video processing tecnologyPrashant Madnavat
 
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency BandDWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency BandIOSR Journals
 
High Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image CompressionHigh Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image Compressionsipij
 
Image compression using discrete cosine transform
Image compression using discrete cosine transformImage compression using discrete cosine transform
Image compression using discrete cosine transformmanoj kumar
 
Image Denoising Techniques Preserving Edges
Image Denoising Techniques Preserving EdgesImage Denoising Techniques Preserving Edges
Image Denoising Techniques Preserving EdgesIDES Editor
 
DCT based Image Watermarking
DCT based Image WatermarkingDCT based Image Watermarking
DCT based Image WatermarkingNisarg Shah
 
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...IJSRD
 

La actualidad más candente (19)

Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154Volume 2-issue-6-2148-2154
Volume 2-issue-6-2148-2154
 
Discrete cosine transform
Discrete cosine transformDiscrete cosine transform
Discrete cosine transform
 
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
Comparative Analysis of Dwt, Reduced Wavelet Transform, Complex Wavelet Trans...
 
DCT
DCTDCT
DCT
 
Iaetsd wavelet transform based latency optimized image compression for
Iaetsd wavelet transform based latency optimized image compression forIaetsd wavelet transform based latency optimized image compression for
Iaetsd wavelet transform based latency optimized image compression for
 
Wavelet video processing tecnology
Wavelet video processing tecnologyWavelet video processing tecnology
Wavelet video processing tecnology
 
145 153
145 153145 153
145 153
 
Dk33669673
Dk33669673Dk33669673
Dk33669673
 
B042107012
B042107012B042107012
B042107012
 
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency BandDWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
DWT-DCT-SVD Based Semi Blind Image Watermarking Using Middle Frequency Band
 
Watermarking
WatermarkingWatermarking
Watermarking
 
Ad24210214
Ad24210214Ad24210214
Ad24210214
 
H0545156
H0545156H0545156
H0545156
 
High Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image CompressionHigh Speed and Area Efficient 2D DWT Processor Based Image Compression
High Speed and Area Efficient 2D DWT Processor Based Image Compression
 
Image compression using discrete cosine transform
Image compression using discrete cosine transformImage compression using discrete cosine transform
Image compression using discrete cosine transform
 
Cb34474478
Cb34474478Cb34474478
Cb34474478
 
Image Denoising Techniques Preserving Edges
Image Denoising Techniques Preserving EdgesImage Denoising Techniques Preserving Edges
Image Denoising Techniques Preserving Edges
 
DCT based Image Watermarking
DCT based Image WatermarkingDCT based Image Watermarking
DCT based Image Watermarking
 
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
Implementation of Vedic Multiplier in Image Compression Using Discrete Wavele...
 

Destacado

Lossless image compression via by lifting scheme
Lossless image compression via by lifting schemeLossless image compression via by lifting scheme
Lossless image compression via by lifting schemeSubhashini Subramanian
 
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...idescitation
 
Real-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingReal-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingDavid Bařina
 
SVD and Lifting Wavelet Based Fragile Image Watermarking
SVD and Lifting Wavelet Based Fragile Image WatermarkingSVD and Lifting Wavelet Based Fragile Image Watermarking
SVD and Lifting Wavelet Based Fragile Image WatermarkingIDES Editor
 

Destacado (6)

Lossless image compression via by lifting scheme
Lossless image compression via by lifting schemeLossless image compression via by lifting scheme
Lossless image compression via by lifting scheme
 
Wavelets @ CPU
Wavelets @ CPUWavelets @ CPU
Wavelets @ CPU
 
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
Modified Adaptive Lifting Structure Of CDF 9/7 Wavelet With Spiht For Lossy I...
 
Real-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet LiftingReal-Time 3-D Wavelet Lifting
Real-Time 3-D Wavelet Lifting
 
SVD and Lifting Wavelet Based Fragile Image Watermarking
SVD and Lifting Wavelet Based Fragile Image WatermarkingSVD and Lifting Wavelet Based Fragile Image Watermarking
SVD and Lifting Wavelet Based Fragile Image Watermarking
 
Lossless
LosslessLossless
Lossless
 

Similar a Lifting Scheme Cores for Wavelet Transform

Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...idescitation
 
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...Ijripublishers Ijri
 
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...Ijripublishers Ijri
 
IRJET- Digital Watermarking using Integration of DWT & SVD Techniques
IRJET- Digital Watermarking using Integration of DWT & SVD TechniquesIRJET- Digital Watermarking using Integration of DWT & SVD Techniques
IRJET- Digital Watermarking using Integration of DWT & SVD TechniquesIRJET Journal
 
Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...Tuan Q. Pham
 
nternational Journal of Computational Engineering Research(IJCER)
nternational Journal of Computational Engineering Research(IJCER)nternational Journal of Computational Engineering Research(IJCER)
nternational Journal of Computational Engineering Research(IJCER)ijceronline
 
Comparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
Comparison of SVD & Pseudo Random Sequence based methods of Image WatermarkingComparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
Comparison of SVD & Pseudo Random Sequence based methods of Image Watermarkingijsrd.com
 
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGESEFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGESijcnac
 
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...IJECEIAES
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMScsandit
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMScsandit
 
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...Ravi Kiran B.
 
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...INFOGAIN PUBLICATION
 
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...INFOGAIN PUBLICATION
 
Improved anti-noise attack ability of image encryption algorithm using de-noi...
Improved anti-noise attack ability of image encryption algorithm using de-noi...Improved anti-noise attack ability of image encryption algorithm using de-noi...
Improved anti-noise attack ability of image encryption algorithm using de-noi...TELKOMNIKA JOURNAL
 
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...VLSICS Design
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMScscpconf
 

Similar a Lifting Scheme Cores for Wavelet Transform (20)

Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
Highly Parallel Pipelined VLSI Implementation of Lifting Based 2D Discrete Wa...
 
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
 
Image denoising using curvelet transform
Image denoising using curvelet transformImage denoising using curvelet transform
Image denoising using curvelet transform
 
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
Ijri ece-01-02 image enhancement aided denoising using dual tree complex wave...
 
IRJET- Digital Watermarking using Integration of DWT & SVD Techniques
IRJET- Digital Watermarking using Integration of DWT & SVD TechniquesIRJET- Digital Watermarking using Integration of DWT & SVD Techniques
IRJET- Digital Watermarking using Integration of DWT & SVD Techniques
 
Ijetr011837
Ijetr011837Ijetr011837
Ijetr011837
 
Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...Parallel implementation of geodesic distance transform with application in su...
Parallel implementation of geodesic distance transform with application in su...
 
nternational Journal of Computational Engineering Research(IJCER)
nternational Journal of Computational Engineering Research(IJCER)nternational Journal of Computational Engineering Research(IJCER)
nternational Journal of Computational Engineering Research(IJCER)
 
Comparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
Comparison of SVD & Pseudo Random Sequence based methods of Image WatermarkingComparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
Comparison of SVD & Pseudo Random Sequence based methods of Image Watermarking
 
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGESEFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
EFFICIENT IMAGE COMPRESSION USING LAPLACIAN PYRAMIDAL FILTERS FOR EDGE IMAGES
 
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
Performance Evaluation of Quarter Shift Dual Tree Complex Wavelet Transform B...
 
Bf36342346
Bf36342346Bf36342346
Bf36342346
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
 
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
AUTO AI 2021 talk Real world data augmentations for autonomous driving : B Ra...
 
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
 
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
4 ijaems jun-2015-5-hybrid algorithmic approach for medical image compression...
 
Improved anti-noise attack ability of image encryption algorithm using de-noi...
Improved anti-noise attack ability of image encryption algorithm using de-noi...Improved anti-noise attack ability of image encryption algorithm using de-noi...
Improved anti-noise attack ability of image encryption algorithm using de-noi...
 
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
A Detailed Survey on VLSI Architectures for Lifting based DWT for efficient h...
 
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMSFINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
FINGERPRINTS IMAGE COMPRESSION BY WAVE ATOMS
 

Más de David Bařina

Lossy Light Field Compression
Lossy Light Field CompressionLossy Light Field Compression
Lossy Light Field CompressionDavid Bařina
 
Mathematical curiosities
Mathematical curiositiesMathematical curiosities
Mathematical curiositiesDavid Bařina
 
New Transforms for JPEG Format
New Transforms for JPEG FormatNew Transforms for JPEG Format
New Transforms for JPEG FormatDavid Bařina
 
Discrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDiscrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDavid Bařina
 
Parallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformParallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformDavid Bařina
 
Parallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesParallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesDavid Bařina
 
Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000David Bařina
 
IIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceIIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceDavid Bařina
 
Akcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDAkcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDDavid Bařina
 
Wavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector ProcessorWavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector ProcessorDavid Bařina
 
Bit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersBit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersDavid Bařina
 
Fixed-point arithmetic
Fixed-point arithmeticFixed-point arithmetic
Fixed-point arithmeticDavid Bařina
 

Más de David Bařina (18)

CCSDS 122.0
CCSDS 122.0CCSDS 122.0
CCSDS 122.0
 
Lossy Light Field Compression
Lossy Light Field CompressionLossy Light Field Compression
Lossy Light Field Compression
 
Mathematical curiosities
Mathematical curiositiesMathematical curiosities
Mathematical curiosities
 
C/C++ tricks
C/C++ tricksC/C++ tricks
C/C++ tricks
 
New Transforms for JPEG Format
New Transforms for JPEG FormatNew Transforms for JPEG Format
New Transforms for JPEG Format
 
JPEG
JPEGJPEG
JPEG
 
Discrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel ArchitecturesDiscrete Wavelet Transforms on Parallel Architectures
Discrete Wavelet Transforms on Parallel Architectures
 
Parallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet TransformParallel Implementation of the 2-D Discrete Wavelet Transform
Parallel Implementation of the 2-D Discrete Wavelet Transform
 
Parallel Wavelet Schemes for Images
Parallel Wavelet Schemes for ImagesParallel Wavelet Schemes for Images
Parallel Wavelet Schemes for Images
 
Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000Single-Loop Software Architecture for JPEG 2000
Single-Loop Software Architecture for JPEG 2000
 
Wavelet News
Wavelet NewsWavelet News
Wavelet News
 
IIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkceIIR aproximace Gaussovy funkce
IIR aproximace Gaussovy funkce
 
Akcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMDAkcelerace DWT pomocí SIMD
Akcelerace DWT pomocí SIMD
 
Wavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector ProcessorWavelet Lifting on Application Specific Vector Processor
Wavelet Lifting on Application Specific Vector Processor
 
GStreamer
GStreamerGStreamer
GStreamer
 
FFmpeg
FFmpegFFmpeg
FFmpeg
 
Bit Twiddling Hacks: Integers
Bit Twiddling Hacks: IntegersBit Twiddling Hacks: Integers
Bit Twiddling Hacks: Integers
 
Fixed-point arithmetic
Fixed-point arithmeticFixed-point arithmetic
Fixed-point arithmetic
 

Último

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
An introduction to Semiconductor and its types.pptx
An introduction to Semiconductor and its types.pptxAn introduction to Semiconductor and its types.pptx
An introduction to Semiconductor and its types.pptxPurva Nikam
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEroselinkalist12
 
computer application and construction management
computer application and construction managementcomputer application and construction management
computer application and construction managementMariconPadriquez1
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .Satyam Kumar
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxk795866
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile servicerehmti665
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniquesugginaramesh
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...121011101441
 

Último (20)

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
An introduction to Semiconductor and its types.pptx
An introduction to Semiconductor and its types.pptxAn introduction to Semiconductor and its types.pptx
An introduction to Semiconductor and its types.pptx
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
 
computer application and construction management
computer application and construction managementcomputer application and construction management
computer application and construction management
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
Introduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptxIntroduction-To-Agricultural-Surveillance-Rover.pptx
Introduction-To-Agricultural-Surveillance-Rover.pptx
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Call Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile serviceCall Girls Delhi {Jodhpur} 9711199012 high profile service
Call Girls Delhi {Jodhpur} 9711199012 high profile service
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniques
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...
 

Lifting Scheme Cores for Wavelet Transform

  • 1. Lifting Scheme Cores for Wavelet Transform David Barina (supervised by Pavel Zemcik) 1 / 24
  • 2. DWT in image processing can be found in many image-processing tasks analysis (edge detection, feature extraction, multiscale representation), compression (JPEG 2000, Dirac), watermarking, edge sharpening, contrast enhancement, tone mapping, denoising, fusion, etc. 2 / 24
  • 3. Filter bank S. Mallat, ”A theory for multiresolution signal decomposition: The wavelet representation” (1989) ˜H(z−1 ) a d ↓ 2 + ˜G(z−1 ) ↓ 2 ↑ 2 H(z) ↑ 2 G(z) decomposition: two complementary filters, high number of operations 3 / 24
  • 4. Lifting scheme I. Daubechies, W. Sweldens, ”Factoring wavelet transforms into lifting steps” (1998) a d split ˜P(z−1 )T P(z) merge P(z) = I−1 i=0 1 Si(z) 0 1 1 0 Ti(z) 1 K 0 0 1/K decomposition: sequence of simple filtering steps, reduces the number of operations, split: even, odd 4 / 24
  • 5. CDF 9/7 wavelet I. Daubechies, W. Sweldens, ”Factoring wavelet transforms into lifting steps” (1998) α β γ δ input output steps even samples odd samples ˜P(z) = 1 α 1 + z−1 0 1 1 0 β (1 + z) 1 1 γ 1 + z−1 0 1 1 0 δ (1 + z) 1 ζ 0 0 1/ζ four two-tap symmetric filters 5 / 24
  • 6. 2-D decomposition S. Mallat, ”A theory for multiresolution signal decomposition: The wavelet representation” (1989) a h v d horizontal vertical h v d a h dv image: 2-D signal, by a series of 1-D transforms, four subbands, multi-scale decomposition 6 / 24
  • 7. Lenna how to calculate this as efficiently as possible 7 / 24
  • 8. Strategies and issues R. Kutil, ”A single-loop approach to SIMD parallelization of 2-D wavelet lifting” (2006) a h v d horizontal vertical strategies row-column, block-based, and line-based cache issues cache line, limited size, set associativity, prefetching techniques padding, aggregation, memory layouts, interleave loops, parallelization the approaches have to repeatedly visit samples, memory access is expensive ⇒ CPU cache, limitations, existing techniques, single-loop approach 8 / 24
  • 9. Unsolved issues 2 × 2 prolog core epilog prolog epilog F F FF complicated border treatment (prolog/epilog phases) suspend/resume processing arbitrary processing order (scan order) interleave the transform and a subsequent processing multi-scale decomposition reorganization of underlying scheme 9 / 24
  • 10. Objectives of the thesis Aims improve image transform performance and resource consumption Objectives eliminate the shortcomings of existing methods previous slide Evaluation prove experimentally performance, memory requirements 10 / 24
  • 11. Lifting core D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press) solution: a processing unit continuously consumes an input and produces an output which visits every image sample only once (cache friendly) which is aware of image coordinates (can handle the borders) whose configuration (state) can be saved/restored which can be run in any direction which can be SIMD vectorized which can run in parallel (on independent parts of the image) y = C x x def = In B y def = On B 11 / 24
  • 12. Core examples D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press) α β γ δ m n 1 2 3 4 core inputs, outputs 12 / 24
  • 13. Processing orders D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press) horizontal horiz. strips horiz. blocks vertical vert. strips vert. blocks 13 / 24
  • 14. Borders treatment D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press) d a d a d a d a d a d a d a d a d a d d a d a d a d a d a d a d a d a d a d a n n n n n n n a d aad n nnnn d a d a d a d a d a d a d a d a d a d 0 d a d a d a d a d a d a d a d a d a d a 2 n N − 2 N 0 0 n n n n n n a y = Cn x cores gracefully treats the boundaries 14 / 24
  • 15. Parallel cores and reorganization M. Kula, D. Barina, et al., ”Block-based Approach to 2-D Wavelet Transform on GPUs” (2016) 1 2 3 4 Sweldens1995 1 2 3 Iwahashi2007 1 2 proposed 15 / 24
  • 16. 3-D core D. Barina, P. Zemcik, ”Real-Time 3-D Wavelet Lifting” (2015) x y z buffer x buffer y buffer z extended into more dimensions, buffers on the sides 16 / 24
  • 17. CPU implementation D. Barina, P. Zemcik, ”Vectorization and parallelization of 2-D wavelet lifting” (in press) 0.0 s 5.0ns 10.0ns 15.0ns 20.0ns 25.0ns 30.0ns 35.0ns 40.0ns 45.0ns 50.0ns 1.0k 10.0k 100.0k 1.0M 10.0M 100.0M time/pixel pixels separable approach core approach an evaluation of approaches, implemented the separable, single-loop, and core 17 / 24
  • 18. 3-D CPU implementation D. Barina, P. Zemcik, ”Real-Time 3-D Wavelet Lifting” (2015) x y z buffer x buffer y buffer z 0.0 s 20.0ns 40.0ns 60.0ns 80.0ns 100.0ns 120.0ns 140.0ns 160.0ns 0.0 50.0M 100.0M 150.0M 200.0M 250.0M time/voxel voxels naive horizontal naive vertical core 42 core 23 core 43 performance of 3-D transform: separable, 2-D core, 3-D core 18 / 24
  • 19. GPU implementation M. Kula, D. Barina, et al., ”Block-based Approach to 2-D Wavelet Transform on GPUs” (2016) 80.0 100.0 120.0 140.0 160.0 180.0 200.0 220.0 240.0 260.0 0.0 10.0M 20.0M 30.0M 40.0M 50.0M 60.0M 70.0M GB/s pixels Kucis2014 Separable Block Non-Separable Block 0 10 20 30 40 50 60 100kpel 1Mpel 10Mpel 100Mpel GB/s Sweldens Iwahashi* Explosive* Monolithic* Polyphase* Monolithic∗ scheme: left: SotA is in red, block methods in blue/green, reorganization right: block methods, separable in black, our in blue/green 19 / 24
  • 20. FPGA implementation D. Barina, et al., ”Single-Loop Approach to 2-D Wavelet Lifting with JPEG 2000 Compatibility” (2015) H V BRAM Input Transform core FF LUT BRAM latency 4 441 (0.1 %) 399 (0.18 %) 6 (1.1 %) latency 2 391 (< 0.1 %) 592 (0.27 %) 6 (1.1 %) architecture device BRAM [bits] clocks/pel time [ms] Dillen2003 VirtexE1000-8 50K 0.50 1.20 Descampe2004 Virtex-II XC2V6000 N/A 0.60 1.75 Seo2007 Altera Stratix 128K 2.64 6.02 Zhang2012 Virtex-II Pro XC2VP30 6 × 18K 0.50 0.97 the cores Zynq XC7Z045 1 × 36K 0.26 0.27 20 / 24
  • 21. JPEG 2000 implementation D. Barina, O. Klima, P. Zemcik, ”Single-Loop Architecture for JPEG 2000” (2016) core codeblock 2 × 2cn 2 × 2cm aj aj+1 h v d 0.0 20.0 40.0 60.0 80.0 100.0 120.0 140.0 100.0k 1.0M 10.0M 100.0M 1.0G time[ns] resolution [pel] proposed OpenJPEG JasPer FFmpeg 21 / 24
  • 22. Contributions of the thesis Aims improved image transform performance and resource consumption Objectives eliminated the shortcomings of existing methods Evaluation assessed experimentally (performance, memory requirements) evaluation performed: 2-D on CPU, 3-D on CPU, 2-D on GPU, 2-D on FPGA, JPEG 2000 on CPU 22 / 24
  • 23. Selected papers Barina, D.; Klima, O.; Zemcik, P.: Single-Loop Software Architecture for JPEG 2000. In Data Compression Conference (DCC), 2016 Barina, D.; Musil, M.; Musil, P.; et al.: Single-Loop Approach to 2-D Wavelet Lifting with JPEG 2000 Compatibility. In Workshop on Applications for MultiCore Architectures (WAMCA), 2015 Barina, D.; Zemcik, P.: Minimum Memory Vectorisation of Wavelet Lifting. In Advanced Concepts for Intelligent Vision Systems (ACIVS), 2013 Barina, D.; Zemcik, P.: Wavelet Lifting on Application Specific Vector Processor. In GraphiCon, 2013 Barina, D.; Zemcik, P.: Diagonal Vectorisation of 2-D Wavelet Lifting. In IEEE International Conference on Image Processing (ICIP), 2014 Barina, D.; Zemcik, P.: Real-Time 3-D Wavelet Lifting. In International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision (WSCG), 2015 Barina, D.; Zemcik, P.: Vectorization and parallelization of 2-D wavelet lifting. Journal of Real-Time Image Processing (JRTIP), in press Barina, D.; Klima, O.; Zemcik, P.: Single-Loop Architecture for JPEG 2000. In: Image and Signal Processing (ICISP), 2016 Kula, M.; Barina, D.; Zemcik, P.: Block-based Approach to 2-D Wavelet Transform on GPUs. In International Conference on Information Technology – New Generations (ITNG), 2016 Kucis, M.; Barina, D.; Kula, M.; et al.: 2-D Discrete Wavelet Transform Using GPU. In Workshop on Application for Multi-Core Architectures (WAMCA), 2014 23 / 24
  • 24. Summary the core computing unit which processes the data in a single pass, can suspend/resume execution, can processes the data in many different orders, can handle signal boundaries (is aware of coordinates), can be easily SIMD vectorized and parallelized, and whose underlying scheme can be reorganized. 24 / 24