Home

Awesome

<!-- [![HitCount](https://hits.dwyl.com/arclab-hku/Event_based_VO-VIO-SLAM.svg?style=flat-square)](http://hits.dwyl.com/arclab-hku/Event_based_VO-VIO-SLAM) --> <div align="center">

Event-based Vision for VO/VIO/SLAM in Robotics

Author: Guan Weipeng, Chen Peiyu

</div>

This is the repositorie that collects the dataset we used in our papers. We also conclude our works in the field of event-based vision. We hope that we can make some contributions for the development of event-based vision in robotics.

If you have any suggestions or questions, do not hesitate to propose an issue.

If you find this repositorie is helpful in your research, a simple star or citation of our works should be the best affirmation for us. :blush:

Dataset for Stereo EVIO

This dataset contains stereo event data at 60HZ and stereo image frames at 30Hz with resolution in 346 × 260, as well as IMU data at 1000Hz. Timestamps between all sensors are synchronized in hardware. We also provide ground truth poses from a motion capture system VICON at 50Hz during the beginning and end of each sequence, which can be used for trajectory evaluation. To alleviate disturbance from the motion capture system’s infrared light on the event camera, we add an infrared filter on the lens surface of the DAVIS346 camera. Note that this might cause the degradation of perception for both the event and image camera during the evaluation, but it can also further increase the challenge of our dataset for the only image-based method.

This is a very challenge dataset for event-based VO/VIO, features aggressive motion and HDR scenarios. EVO, ESVO, Ultimate SLAM are failed in most of the sequences. We think that parameter tuning is infeasible, therefore, we suggest the users use same set of parameters during the evaluation. We hope that our dataset can help to push the boundary of future research on event-based VO/VIO algorithms, especially the ones that are really useful and can be applied in practice.

Acquisition Platform

<div align="center"> <a target="_blank"><img src="ESVIO/quadrotor_flight.jpg" alt="image" width="80%" /></a> <p> The Platform for Data Collection </p> </div>

Driver Installation

We thanks the rpg_dvs_ros for intructions of event camera driver.

<!-- We modified the source code of the [rpg_dvs_ros](https://github.com/uzh-rpg/rpg_dvs_ros) with consistent image size. -->

We add the function of the hardware synchronized for stereo setup, the source code is available in link. After installing the driver, the user can directly run the following command to run your stereo event camera:

roslaunch stereo_davis_open.launch

Tips: Users need to adjust the lens of the camera, such as the focal length, aperture. Filters are needed for avoiding the interfere from infrared light under the motion capture system. For the dvxplorer, the sensitive of event generation should be set, e.g. bias_sensitivity. Users can visualize the event streams to see whether it is similiar to the edge map of the testing environments, and then fine-tune it. Otherwise, the event sensor would output many noise and ultimately leading the event data as useless as the M2DGR datasets.

Data Sequence

In our VICON room:

<div align="center">
Sequence NameCollection DateTotal SizeDurationFeaturesOne DriveBaidu Disk
hku_agg_translation2022-103.63g---aggressiveRosbagRosbag
hku_agg_rotation2022-103.70g---aggressiveRosbagRosbag
hku_agg_flip2022-103.71g---aggressiveRosbagRosbag
hku_agg_walk2022-104.52g---aggressiveRosbagRosbag
hku_hdr_circle2022-102.91g---hdrRosbagRosbag
hku_hdr_slow2022-104.61g---hdrRosbagRosbag
hku_hdr_tran_rota2022-103.37g---aggressive & hdrRosbagRosbag
hku_hdr_agg2022-104.43g---aggressive & hdrRosbagRosbag
hku_dark_normal2022-104.24g---dark & hdrRosbagRosbag
</div>

Outdoor large-scale (outdoor without ground truth):

The path length of this data sequence is about 1866m, which covers the place around 310m in length, 170m in width, and 55m in height changes, from Loke Yew Hall to the Eliot Hall and back to the Loke Yew Hall in HKU campus. That would be a nice travel for your visiting the HKU :heart_eyes: Try it!

<div align="center">
Sequence NameCollection DateTotal SizeDurationFeaturesRosbag
hku_outdoor_large-scale2022-1167.4g34.9minutesIndoor+outdoor; large-scaleRosbag
</div>

Dataset for Monocular EVIO

You can use these data sequence to test your monocular EVIO in different resolution event cameras. TheDAVIS346 (346x260) and DVXplorer (640x480)are attached together (shown in Figure) for facilitating comparison. All the sequences are recorded in HDR scenarios with very low illumination or strong illumination changes through switching the strobe flash on and off. We also provide indoor and outdoor large-scale data sequence.

Acquisition Platform

<div align="center"> <a target="_blank"><img src="IROS2022/sensor_setup.png" alt="image" width="100%" /></a> <p> The Platform for Data Collection </p> </div>

Data Sequence

With VICON as ground truth:

<div align="center">
Sequence NameCollection DateTotal SizeDurationFeaturesOne DriveBaidu Disk
vicon_aggressive_hdr2021-1223.0g---HDR, Aggressive MotionRosbagRosbag
vicon_dark12021-1210.5g---HDRRosbagRosbag
vicon_dark22021-1216.6g---HDRRosbagRosbag
vicon_darktolight12021-1217.2g---HDRRosbagRosbag
vicon_darktolight22021-1214.4g---HDRRosbagRosbag
vicon_hdr12021-1213.7g---HDRRosbagRosbag
vicon_hdr22021-1216.9g---HDRRosbagRosbag
vicon_hdr32021-1211.0g---HDRRosbagRosbag
vicon_hdr42021-1219.6g---HDRRosbagRosbag
vicon_lighttodark12021-1217.0g---HDRRosbagRosbag
vicon_lighttodark22021-1212.0g---HDRRosbagRosbag
</div>

indoor (no ground truth):

<div align="center">
Sequence NameCollection DateTotal SizeDurationFeaturesRosbag (Baidu Disk)
indoor_aggressive_hdr_12021-1216.62g---HDR, Aggressive MotionRosbag
indoor_aggressive_hdr_22021-1215.66g---HDR, Aggressive MotionRosbag
indoor_aggressive_test_12021-1217.94g---Aggressive MotionRosbag
indoor_aggressive_test_22021-128.385g---Aggressive MotionRosbag
indoor_12021-123.45g------Rosbag
indoor_22021-125.31g------Rosbag
indoor_32021-125.28g------Rosbag
indoor_42021-126.72g------Rosbag
indoor_52021-1213.79g------Rosbag
indoor_62021-1220.39g------Rosbag
</div>

Outdoor (no ground truth):

<div align="center">
Sequence NameCollection DateTotal SizeDurationFeaturesRosbag (Baidu Disk)
indoor_outdoor_12021-1220.87g---******Rosbag
indoor_outdoor_22021-1239.5g---******Rosbag
outdoor_12021-125.52g---******Rosbag
outdoor_22021-125.27g---******Rosbag
outdoor_32021-126.83g---******Rosbag
outdoor_42021-127.28g---******Rosbag
outdoor_52021-127.26g---******Rosbag
outdoor_62021-125.38g---******Rosbag
outdoor_round12021-1211.27g---******Rosbag
outdoor_round22021-1213.34g---******Rosbag
outdoor_round32021-1237.26g---******Rosbag
</div>

On quadrotor platform (sample sequence in our PL-EVIO work):

We also provide the data squences that are collected in the flighting quadrotor platform using DAVIS346.

<div align="center"> <a target="_blank"><img src="PL-EVIO/sensor_setup.jpg" alt="image" width="100%" /></a> <p> The Platform for Data Collection </p> </div> <div align="center">
Sequence NameCollection DateTotal SizeDurationFeaturesRosbag
Vicon_dvs_fix_eight2022-081.08g---quadrotor flightingRosbag
Vicon_dvs_varing_eight2022-081.48g---quadrotor flightingRosbag
outdoor_large_scale12022-089.38g16 minutes******Rosbag
outdoor_large_scale22022-089.34g16 minutes******Rosbag
</div>

Modified Public Dataset

Modified VECtor Dataset

VECtor dataset covering the full spectrum of motion dynamics, environment complexities, and illumination conditions for both small and large-scale scenarios. We modified the frequency of the event_left and event_right (60Hz) and the message format from "prophesee_event_msgs/EventArray" to "dvs_msgs/EventArray" in the VECtor dataset, so that there is more event information in each frame and we can extract effective point and line features from the event stream. We release this modified VECtor Dataset to facilitate research on event camera. For the convenience of the user, we also fuse the individual rosbag from different sensors together (left_camera, right_camera, left_event, right_event, imu, groundtruth).

<div align="center"> <a target="_blank"><img src="Others/overview of Vector.png" alt="image" width="100%" /></a> <p> Overview of Vector dataset </p> </div> <div align="center">
Sequence NameTotal SizeOne DriveBaidu Disk
board-slow3.18gRosbagRosbag
corner-slow3.51gRosbagRosbag
robot-normal3.39gRosbagRosbag
robot-fast4.23gRosbagRosbag
desk-normal8.82gRosbagRosbag
desk-fast10.9gRosbagRosbag
sofa-normal10.8gRosbagRosbag
sofa-fast6.7gRosbagRosbag
mountain-normal10.9gRosbagRosbag
mountain-fast16.6gRosbagRosbag
hdr-normal7.73gRosbagRosbag
hdr-fast13.1gRosbagRosbag
corridors-dolly7.78gRosbagRosbag
corridors-walk8.56gRosbagRosbag
school-dolly12.0gRosbagRosbag
school-scooter5.91gRosbagRosbag
units-dolly18.5gRosbagRosbag
units-scooter11.6gRosbagRosbag
</div>

Modified DSEC Dataset

DSEC is a stereo camera dataset in driving scenarios that contains data from two monochrome event cameras and two global shutter color cameras in favorable and challenging illumination conditions. In addition, it also collects Lidar data, IMU and RTK GPS measurements. However, the data sequence of different sensors in DSEC are divided and in different data formats, which is very unfriendly to users. Therefore, we convert them into same rosbag which might be easier for event-based VIO evaluation. The code of processing the data can be also available in here.

<div align="center"> <a target="_blank"><img src="Others/overview of DSEC.gif" alt="image" width="100%" /></a> <p> Overview of DSEC dataset </p> </div> <div align="center">
Sequence NameTotal SizeOne Drive
zurich city 04 (a)13.8gRosbag
zurich city 04 (b)5.33gRosbag
zurich city 04 (c)18.7gRosbag
zurich city 04 (d)15.5gRosbag
zurich city 04 (e)4.94gRosbag
zurich city 04 (f)15.1gRosbag
</div>

Our Works in Event-based Vision

1. Mono-EIO

This work proposed event inertial odometry (EIO). We do not rely on the use of image-based corner detection but design a asynchronously detected and uniformly distributed event-cornerdetector from events-only data. The event-corner features tracker are then integrated into a sliding windows graph-based optimization framework that tightly fuses the event-corner features with IMU measurement to estimate the 6-DoF ego-motion.

<!-- * Code is available in [internal-accessed link](https://github.com/arclab-hku/EVIO/tree/evio_mono_noetic) --> <div align="center"> <a href="https://www.bilibili.com/video/BV1Dg411y7ds/?spm_id_from=333.999.0.0" target="_blank"><img src="IROS2022/cover.jpg" alt="video" width="100%" /></a> <p> Demo Video (click the image to open video demo) </p> </div>
@inproceedings{GWPHKU:Mono-EIO,
  title={Monocular Event Visual Inertial Odometry based on Event-corner using Sliding Windows Graph-based Optimization},
  author={Guan, Weipeng and Lu, Peng},
  booktitle={2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)},
  pages={2438-2445},
  year={2022},
  organization={IEEE}
}

2. PL-EVIO

This work proposed the event-based visual-inertial odometry (EVIO) framework with point and line features, including: pruely event (PL-EIO) and event+image (PL-EVIO). It is reliable and accurate enough to provide onboard pose feedback control for the quadrotor to achieve aggressive motion, e.g. flipping.

<!-- * Code is available in [internal-accessed link](https://github.com/arclab-hku/EVIO/tree/PL-EIO) --> <div align="center"> <a href="https://www.bilibili.com/video/BV1c14y1j75L/?spm_id_from=333.999.0.0&vd_source=a88e426798937812a8ffc1a9be5a3cb7" target="_blank"><img src="PL-EVIO/cover.jpg" alt="video" width="100%" /></a> <p> Demo Video (click the image to open video demo) </p> </div> <div align="center"> <a href="https://www.bilibili.com/video/BV1i24y1R7KV/?spm_id_from=333.999.list.card_archive.click&vd_source=a88e426798937812a8ffc1a9be5a3cb7" target="_blank"><img src="PL-EVIO/PLEVIO_flip_3.gif" alt="video" width="100%" /></a> <p> Onboard Quadrotor Flip using Our PL-EVIO (click the gif to open video demo)</p> </div> <!-- <div align="center"> <a href="https://www.bilibili.com/video/BV1i24y1R7KV/?spm_id_from=333.999.list.card_archive.click&vd_source=a88e426798937812a8ffc1a9be5a3cb7" target="_blank"><img src="PL-EVIO/flip.jpg" alt="video" width="100%" /></a> <p> Onboard Quadrotor Flip using Our PL-EVIO (click the image to open) </p> </div> -->
@article{GWPHKU:PL-EVIO,
  title={PL-EVIO: Robust Monocular Event-based Visual Inertial Odometry with Point and Line Features},
  author={Guan, Weipeng and Chen, Peiyu and Xie, Yuhan and Lu, Peng},
  journal={IEEE Transactions on Automation Science and Engineering},
  year={2023}
}

3. ESVIO

This work proposed the first stereo event-based visual inertial odometry framework, including ESIO (purely event-based) and ESVIO (event with image-aided). The stereo event-corner features are temporally and spatially associated through an event-based representation with spatio-temporal and exponential decay kernel. The stereo event tracker are then tightly coupled into a sliding windows graph-based optimization framework for the estimation of ego-motion.

<!-- * Code is available in [internal-accessed link](https://github.com/arclab-hku/ESVIO) --> <div align="center"> <a href="https://www.bilibili.com/video/BV1ve4y1M7v4/?share_source=copy_web&vd_source=a722388e07ea53f32d00aed0a0117f3c" target="_blank"><img src="ESVIO/ESVIO_hdr_flight_gif.gif" alt="video" width="100%" /></a> <p> Onboard Quadrotor Flight using Our ESVIO as State Estimator (click the gif to open video demo)</p> </div>
@article{GWPHKU:ESVIO,
  title={ESVIO: Event-based Stereo Visual Inertial Odometry},
  author={Chen, Peiyu and Guan, Weipeng and Lu, Peng},
  journal={IEEE Robotics and Automation Letters},
  year={2023},
  volume={8},
  number={6},
  pages={3661-3668},
  publisher={IEEE}
}

4. ECMD

ECMD is an event-based dataset for autonomous driving. It provides data from two sets of stereo event cameras with different resolutions (640x480, 346x260), stereo industrial cameras, an infrared camera, a top-installed mechanical LiDAR with two slanted LiDARs, two consumer-level GNSS receivers, and an onboard IMU. Meanwhile, the ground-truth of the vehicle was obtained using a centimeter-level high-accuracy GNSS-RTK/INS navigation system.

<div align="center"> <a href="https://www.bilibili.com/video/BV1pN411s79g/?spm_id_from=333.337.search-card.all.click&vd_source=c4be0359ec60c90d434f634ab4075470" target="_blank"><img src="ECMD/homepage_vis.gif" alt="video" width="100%" /></a> <p> Overview of ECMD (click the gif to open video demo)</p> </div>
@article{GWPHKU:ECMD,
  title={ECMD: An Event-Centric Multisensory Driving Dataset for SLAM},
  author={Chen, Peiyu and Guan, Weipeng and Huang, Feng and Zhong, Yihan and Wen, Weisong and Hsu, Li-Ta and Lu, Peng},
  journal={IEEE Transactions on Intelligent Vehicles},
  year={2023}
}

5. EVI-SAM

EVI-SAM is a full event-based SLAM system that tackle the problem of 6-DoF pose tracking and 3D dense mapping using the monocular event camera. To the best of our knowledge, this is the first framework that employs a non-learning approach to achieve event-based dense and textured 3D reconstruction without GPU acceleration. Additionally, it is also the first hybrid approach that integrates both direct-based and feature-based methods within an event-based framework.

<div align="center"> <a href="https://www.bilibili.com/video/BV19w411b7te/?spm_id_from=333.999.top_right_bar_window_history.content.click&vd_source=a88e426798937812a8ffc1a9be5a3cb7" target="_blank"><img src="EVI-SAM/first_image.png" alt="video" width="100%" /></a> <p> Demo Video (click the image to open video demo) </p> </div>
@article{GWPHKU:EVI-SAM,
  title={EVI-SAM: Robust, Real-Time, Tightly-Coupled Event--Visual--Inertial State Estimation and 3D Dense Mapping},
  author={Guan, Weipeng and Chen, Peiyu and Zhao, Huibin and Wang, Yu and Lu, Peng},
  journal={Advanced Intelligent Systems},
  pages={2400243},
  year={2024},
  publisher={Wiley Online Library}
}

6. DEIO

Learning-based SLAM has long been highly regarded, yet its generalization capabilities remain in question, this work takes learning-based VIO to a new level. We design the learning-optimization-combined framework that tightly-coupled integrate trainable event-based differentiable bundle adjustment (e-DBA) with IMU pre-integration in a patch-based co-visibility factor graph that employs keyframe-based sliding window optimization. The framework is also designed to be easily plug-and-play, with DEIO for event-IMU modalities and DVIO† for image-IMU modalities.

<div align="center"> <a href="https://kwanwaipang.github.io/DEIO/" target="_blank"><img src="https://github.com/arclab-hku/DEIO/blob/main/Figs/cover_figure.png" alt="video" width="100%" /></a> <p> Framework Overview (click the image to open the project website) </p> </div>
@article{GWPHKU:DEIO,
  title={DEIO: Deep Event Inertial Odometry},
  author={Guan, Weipeng and Lin, Fuling and Chen, Peiyu and Lu, Peng},
  journal={arXiv preprint arXiv:2411.03928},
  year={2024}
}

Using Our Methods as Comparison

We strongly recommend the peers to evaluate their proposed method using our dataset, and do the comparison with the raw results from our methods using their own accuracy criterion.

The raw results/trajectories of our methods can be obtained in :point_right: here.

Recommendation

<!-- * Useful tools: - https://github.com/TimoStoff/event_utils - https://github.com/tub-rip/events_viz -->

LICENSE

This repositorie is licensed under MIT license. International License and is provided for academic purpose. If you are interested in our project for commercial purposes, please contact Dr. Peng LU for further communication.