메뉴 건너뛰기

추천
검색
질문

논문 기본 정보

자료유형
학술저널
저자정보
(Korea Institute of Industrial Technology) (Korea Maritime and Ocean University) (Dankook University) (Korea Maritime and Ocean University) (Korea Maritime and Ocean University)
저널정보
제어로봇시스템학회 제어로봇시스템학회 논문지 제어로봇시스템학회 논문지 제29권 제12호
발행연도
수록면
1,015 - 1,023 (9page)
DOI
10.5302/J.ICROS.2023.23.0152

이용수

표지
📌
연구주제
📖
연구배경
🔬
연구방법
이 논문의 연구방법이 궁금하신가요?
🏆
연구결과
이 논문의 연구결과가 궁금하신가요?
AI에게 요청하기
추천
검색
질문

초록· 키워드

Monocular depth estimation (MDE) is a computer vision task that estimates the depth value for each pixel in a single monocular RGB image. Given its inherently ill-posed nature, recent studies have focused on data-driven learning-based methods, particularly deep learning techniques based on convolutional neural networks. Building on the success of vision transformers (ViTs), ViT-based models have gained widespread attention for MDE applications. To enhance the performance of MDE, we introduce an intermediate bi-directional long-short term memory (BiLSTM) module, a common component in natural language processing, to learn correlations between each token in a sequence both in the forward and backward directions. Given that transformers naturally encode sequential data, leveraging BiLSTMs to learn the correlations between each token in both directions is a reasonable approach. In addition, we enhance the weight initialization method for BiLSTMs by employing the CLS tokens of transformers, which was originally developed for natural language processing, to make it applicable for processing two-dimensional image data. Subsequently, we developed an MDE model by inserting a BiLSTM module initialized using our method between the encoder and decoder of a DPTbased architecture. Our experiments reveal notable results, with our model achieving 0.913 in 𝛿₁ and 0.097 in AbsRel for the NYU Depth V2 dataset. These results demonstrate a significant improvement over the previous models.
상세정보 수정요청해당 페이지 내 제목·저자·목차·페이지
정보가 잘못된 경우 알려주세요!

목차

  1. Abstract
  2. Ⅰ. INTRODUCTION
  3. Ⅱ. RELATED WORKS
  4. Ⅲ. PROPOSED ARCHITECTURE
  5. Ⅳ. EXPERIMENTS
  6. Ⅴ. CONCLUSION
  7. REFERENCES

참고문헌

참고문헌 신청

최근 본 자료

전체보기