|
|
|
|
|
|
|
|
This project aims
at proposing a new functionality to Scalable Video Coding (SVC), which is
the support of multiple ROIs for heterogeneous display resolution. Scalable
video coding is targeted by giving temporal, spatial, and quality scalability
for the encoded bit stream. The region of interest (ROI) is an area that is
semantically important to a particular user, especially users with
heterogeneous display resolutions. The bitstream containing the ROIs could be
extracted without any transcoding operations, which may be one of the ways to
satisfy QoS.
|
|
|
Introduction |
Region of Interest (ROI) could be considered as one of the
semantic scalability in spatial dimension. Fig 1. shows ROI combined
scalability in the SVC.
|
¡¡
|
Fig. 1.
Spatial scalability with ROI |
|
Due to the restriction of resolution or display size such as mobile
environment, spatially downscaled video is provided to the use. But user may
not satisfy the small sized video, and may wants to what he/she wants to see
with enough large resolution. Defining ROI in the picture and providing only
ROI containing video stream would be a good solution to the user in that
situation. In this case, even display size is the same, semantically meaningful
region can be provided with better resolution. Most videos have ROI that is
semantically more meaningful than other regions in the picture. For example,
people in the picture are more meaningful than background. For this reason, ROI
is one of the SVC requirements. Fig 2. shows the usage of the ROI scalability.
|
Currently, MPEG and ITU-T are jointly making a standard scalable video codec
that is based on Motion Compensated Temporal Filtering (MCTF) and H.264. The
objectives of this scalable video codec are generating temporal, spatial, and
quality scalable coded stream, therefore users can be provided QoS guaranteed
streaming services independent of video consuming device in heterogeneous
network environment. But, even in the restriction of resolution or display
size, user wants to see what he/she wants to see with enough large resolution.
Defining Region of Interest (ROI) in the picture and providing only ROI
containing video stream would be good solution to the user in that situation.
Therefore, we assume the case when there exist more than one ROI in the
picture, and each ROI can be decoded independently with spatial, temporal, and
quality scalabilities. To accomplish the objective, we used FMO (Flexible
Macroblock Ordering) in the H.264 to describe ROIs. But FMO is not enough for
independent decoding of multiple ROIs. The first problem is related with
overlapping between ROIs. The second problem is related with slice group
boundary in independent decoding of ROI. In this paper, we present solutions to
address these difficulties.
|
|
1. ROI description in SVC |
(a)
(b)
|
Fig. 2. ROI description :
(a) ROI to slice group mapping (b) assignment of slice group id
|
|
2.
Independent ROI decoding |
|
FMO is not
enough for independent decoding of ROI due to the characteristics of predictive
coding and inter-pixel dependent processing. To prevent decoding dependency
between slice groups, FMO disables intra-prediction from the macroblocks
outside of a slice group. However, it only avoids the decoding dependency that
resides in the current picture; there still exists decoding dependency in
temporal direction by motion compensation. And, in the boundary of ROI, half
sample interpolation for motion estimation (ME) / compensation (MC) and
upsampling for Intra Base mode also cause a problem due to interdependency
between slice groups.
|
|
Fig.3 and Fig. 4 describe the handling of ROI boundary for
halp-pel interpolation and upsampling. |
|
|
|
|
Fig. 3. Handling ROI
boudnary in half-pel interpolation |
|
Fig. 4. ROI description
handling ROI boudnary in upsampling for Intra_Base mode |
|
|
|
Tae Meon Bae, Truong Cong Thang, Duck Yeon Kim, Yong Man Ro, and
Jae-Gon Kim, ¡°SPATIAL SCALABILITY of MULTIPLE ROIs in SCALABLE VIDEO CODING,¡±
SPIE2006(accepted)
|
|
Truong Cong Thang, Tae Meon Bae, Yong Ju Jung, Yong Man Ro,
Jae-Gon Kim, Haechul Choi, Jin-Woo Hong ,¡°Spatial Scalability of Multiple ROIs
in Surveillance Video¡± ISO/IEC JTC1/SC29/WG11 M12010, April 2005, Busan, Korea
|
|
Truong Cong Thang, Tae Meon Bae, Yong Ju Jung, Yong Man Ro, Jung
Won Kang, Haechul Choi, Jae-Gon Kim, Jin-Woo Hong, ¡°SVC CE8 report: Spatial
scalability of multiple ROIs,¡± m12321, July, 2005, Poznan, Poland
|
|
Truong Cong Thang, Tae Meon Bae, Yong Man Ro, Jung Won Kang,
Jae-Gon Kim, ¡°Improvements of Scalability Information SEI message,¡± m12635,
Oct. Nice, France
|
|
Truong Cong Thang, Tae Meon Bae, Yong Man Ro, Jung Won Kang,
Jae-Gon Kim, ¡°Boundary handling for ROI scalability,¡± m12636, Oct. Nice, France
|
|
Truong Cong Thang, Tae Meon Bae, Yong Man Ro, Jung Won Kang,
Jae-Gon Kim, ¡°Show case of ROI extraction using scalability information SEI
message,¡± m12637, Oct. Nice, France
|
|
±è´ö¿¬, ¹èŸé, ³ë¿ë¸¸, °Á¤¿ø, ±èÀç°ï,¡°½ºÄÉÀÏ·¯ºí ºñµð¿À ÄÚµù¿¡¼ ROI ÃßÃâ¿¡ °üÇÑ ¿¬±¸¡±, 2005³âµµ
Çѱ¹¸ÖƼ¹Ìµð¾îÇÐȸ Ãß°èÇмú¹ßÇ¥´ëȸ, Nov. 2005, (Accepted). |
|
¹èŸé, ±è´ö¿¬, ³ë¿ë¸¸, °Á¤¿ø, ±èÀç°ï,¡°½Ç½Ã°£ ½ºÄÉÀÏ·¯ºô·¯Æ¼ º¯È¯ SVC ºñÆ®½ºÆ®¸²¿¡ ´ëÇÑ ¿¬±¸¡±, 2005³âµµ
Çѱ¹¹æ¼Û°øÇÐȸ Çмú´ëȸ, pp.163-166, Nov. 2005.
|
|
±è´ö¿¬, ¹èŸé, ±è¿µ¼®, ³ë¿ë¸¸, ÃÖÇØö, ±èÀç°ï, ¡°QoS¸¦ À§ÇÑ ½Ç½Ã°£ °¡º¯ ½ºÄÉÀÏ·¯ºô·¯Æ¼¸¦ °¡Áø SVC
ºñÆ®½ºÆ®¸² ÃßÃâ±â¿¡ ´ëÇÑ ¿¬±¸¡±, 2005³âµµ ´ëÇÑÀüÀÚ°øÇÐȸ Ãß°èÁ¾ÇÕÇмú´ëȸ, Nov. 2005, (Accepted) |
|
±è´ö¿¬, ¹èŸé, ³ë¿ë¸¸, °Á¤¿ø, ±èÀç°ï, ¡°½ºÄÉÀÏ·¯ºí ºñµð¿À ÄÚµù¿¡¼ÀÇ ´ÙÁß ROIÀÇ ±¸Çö¡±, 2005³âµµ
Ãß°è½Åȣó¸®ÇÕµ¿Çмú´ëȸ ³í¹®Áý, p.114, Oct. 2005. |
|
|
|
|
|
|