Programme

The conference is taking place in John Britten 102 Conference Foyer. The John Britten building is at 69 Creyke Road.
The Zoom link for virtual attendees has been emailed to all authors.
Please let Richard or Joe know if you do not have access.

The format for oral presentations is:
15 minutes talk
3 minutes questions
2 minutes for changeover

Key Notes

Keynote 1: Jeff Donatelli

Multi-Tiered Iterative Projection Algorithms for Exploiting Mathematical Structure in Complex Inverse Problems

New experimental technologies have the potential to capture information from important biological objects and new materials at unprecedented detail and scales. However, the accelerating size, rate, complexity, and sensitivity of these new measurements are far outpacing the capability of traditional inversion methods to efficiently and robustly reconstruct the desired information from the data.

In this talk, I will present a new general mathematical framework capable of overcoming many of these emerging challenges in solving complex inverse problems from experimental data. This framework, called Multi-Tiered Iterative Projections (M-TIP), is based on exploiting the mathematical structure of the inverse problem via an iterative application of projection operators that directly target the specific physics of the experiment to maximize speed, accuracy, and robustness. I will demonstrate the use of M-TIP in solving important open problems in inversion for fluctuation X-ray scattering, coherent surface scattering imaging, physics-informed data reduction, and more.

Keynote 2: Julian Maclaren

NZ’s Productivity Challenge: Can Computer Vision and Machine Learning Startups Help Solve the Problem?

Over the last 50 years, New Zealand’s productivity has declined substantially relative to other OECD countries. Tech startups are a promising path to providing new exports that do not compete for land with our primary industries and can generate high paying jobs right here in NZ. This is particularly true in areas like computer vision and AI, where the potential for scalable products with global impact is immense. However, NZ’s startup ecosystem is still developing, and the idea of founding a company is simply not on the radar of most researchers here.

In this talk, I aim to demystify the startup journey, offering suggestions and possibly some inspiration for academics who would like to see their research end up in products. I’ll share my personal experiences gained by founding several computer vision and ML startups, both from overseas and from NZ, as well as from within and from outside academia. I will touch on topics such as patents and IP, forming teams, and venture capital. I hope that attending this talk will make you a little more likely to consider launching a startup and more aware of the opportunities and challenges that await those who take the leap.

Keynote 3: Andrew Lensen

AI for Good: Kākā Conservation and Milk Analysis

Computer vision holds significant promise for addressing real-world challenges, but translating this potential into tangible outcomes requires a strongly interdisciplinary approach. In this keynote, I will share early-stage findings from two of our applied AI research projects that aim to address pressing issues in conservation and milk quality assurance.

The first project, Recognising Taonga with AI, recently funded by an MBIE Smart Ideas grant, weaves together research in AI, ecology, and mātauranga Māori. It aims to develop unsupervised machine learning methods for non-invasive identification of individual kākā in urban environments. It also aligns conservation efforts with indigenous perspectives and expands ecological understanding of kākā behaviour and movements.

The second project applies machine learning to analyse the evaporation dynamics of milk droplets, offering a low-cost, interpretable method for classifying milk composition and detecting adulterants. This work is particularly relevant for improving food safety and quality assurance in resource-limited settings, with the potential for long-term applications in developing regions to enhance public health and food security.

Keynote 4: Mengjie Zhang

Evolutionary Machine Learning and Applications

Since the 1990s, evolutionary computation techniques have been widely used to solve machine learning tasks. In this talk, I will firstly provide a brief overview of machine learning and evolutionary computation, then provide a narrow view and a broad view of evolutionary machine learning. After discussing the state-of-the-art research and applications of the main paradigms of evolutionary machine learning and their success in classification, feature selection, regression, clustering, computer vision and image analysis, I will discuss evolutionary deep learning for image classification and detection with the main challenges.

Oral Presentations

(v) = virtual, (p) = in-person

Session 1: Microscopy (chair: Joe Chen)

95 (p)	End-to-end Training of Latent Space Diffusion Models for Conformational Heterogeneity in Cryo-EM Reconstruction	Zixi Hu and Kanupriya Pande
29 (v)	Disentanglement-based Unsupervised Domain Adaptation for Nuclei Instance Segmentation	Jieting Long, Dongnan Liu, Zihao Tang and Weidong Cai
71 (p)	Deep Learning Classification of Microsatellite Status in Colorectal Cancer Whole Slide Images	Anu Ahuja, Arthur Morley-Bunker and Ramakrishnan Mukundan

Session 2: Medical Imaging (chair: Phil Bones)

31 (v)	GroupLearning: Label Noise Mitigation through Multi-Expert Collaboration for Medical Image Analysis	Qinyi Cao, Dongnan Liu, Jianan Fan and Weidong Cai
34 (p)	Towards Clinically Oriented Feature Detection for Melanoma: A Deep Learning Approach	Taran Cyriac John, Qurrat Ul Ain, Harith Al-Sahaf and Mengjie Zhang
46 (p)	Medical Image Synthesis using Autoencoder with Vision Transformer	Zakia Zinat Choudhury, Brendan McCane and Sean Coffey
72 (p)	Spectral Neural Attenuation Fields For Cone Beam CT	Aaron Smith, James Atlas, Niels de Ruiter and Mars Collaboration
93 (p)	Measurement of Ocular Torsion Using Feature Matching on the Iris and Sclera	Arthur Bell and Richard Green

Session 3: Computer Vision for Agriculture (chair: Richard Clare)

40 (v)	Deep Learning for Brassica Oleracea Instance Segmentation in UAV Imagery	Jonel Macalisang
53 (v)	AI Framework for Detecting Lordosis-Kyphosis-Scoliosis in Salmon X-Rays	Loc Nguyen, Seumas Walker, Jane Symonds and Binh Nguyen
62 (v)	Genetic Programming-Based Multi-Object Matching for Mussel Floats in Mussel Farm Images	Dylon Zeng, Ying Bi, Ivy Liu, Bing Xue, Ross Vennell and Mengjie Zhang
82 (p)	Leaf or leaves? A data-centric approach looking at ground truth effect on apple disease segmentation	Masoumeh Keshavarzi, Carl Mesarich, Martin Johnson, Gourab Sen Gupta and Donald Bailey
66 (v)	MAC-VTON: Multi-modal Attention Conditioning for Virtual Try-On with Diffusion-Based Inpainting	Sanhita Pathak, Vinay Kaushik and Brejesh Lall

Session 4: Astronomical Imaging (chair: Joe Chen)

8 (v)	Wavefront Estimation from Pyramid Type Wavefront Sensors Using a Convolutional Neural Network	Kylen Patel, Le Yang and Richard Clare
14 (p)	On Feed-Forward Neural-Network-Based Atmospheric Tomography	Daniel Hopkins, Le Yang and Richard Clare
64 (v)	Demystifying Galaxy Classification : An elegant and powerful hybrid approach	Ankita Sarkar, Sarbani Palit and Ujjwal Bhattacharya

Session 5: Machine Learning for Computer Graphics (chair: Le Yang)

21 (p)	Enhancing Visual Focus: Incorporating Attention into Rule-Based Machine Learning	Abubakar Siddique, Muhammad Iqbal and Will N. Browne
33 (v)	ZS-ACL: Light-weight Zero-shot Image Denoising using alpha-Conditional Loss	Shahmir Khan Mohammed and Shakti Singh
87 (v)	Procedurally Generating Large Synthetic Worlds: Chunked Hierarchical Wave Function Collapse	Robert Christie, Brian Kitchen, Wiktor Tumilowicz, Steffan Hooper and Burkhard Wünsche
91(v)	Gaze Estimation via Synthetic Event-Driven Neural Networks	Himanshu Kumar, Naval Kishore Mehta, Sumeet Saurav and Sanjay Singh

Session 6: Object Detection and Identification (chair: Donald Bailey)

18 (v)	Contrastive Learning for Detecting Invasive Aquatic Species: An Analysis of SimCLR and Self-Supervised Methods with Unlabeled Video Data	Sadia Nasrin Tisha and Greg Hamerly
45 (p)	Foreign Object Detection in Aqueous Food Media using Surface Electric Potential and Machine Learning Techniques	Kin Wai Lee, Michael Hayes, Bill Heffernan, Phil Bones and Jaco Fourie
58 (p)	Impact of Object Detector Accuracy on Tracking-By-Detection Methods: A Case Study with Meerkats	Yuxuan Dong, Patrice Delmas and Mitchell Rogers
63 (p)	Deep Learning-Based Depth Map Generation and YOLO-Integrated Distance Estimation for Radiata Pine Branch Detection Using Drone Stereo Vision	Yida Lin, Bing Xue, Mengjie Zhang, Sam Schofield and Richard Green
75 (p)	Vote Based Line Aggregation for Sports Field Registration	Stephen Hallett, Shahrokh Heidari, Mitchell Rogers and Patrice Delmas
79 (p)	Detection of Pinus Radiata Cutpoints Using Neural Medial Axis Skeletonization	Bradley Scott, Sam Schofield and Richard Green
83 (p)	Re-Identification of Individual Kākā: An Explainable DINO-Based Model	Paula Maddigan, Oskar Ehrhardt, Andrew Lensen and Rachael Shaw
84 (v)	Top-Down Target Object Detection Through Context	Ibrahim Rahman, Christopher Hollitt, Mengjie Zhang, Osama Rehman, Aisha Ajmal and Simon Jigwan Park

Session 7: Motion Tracking Control and Analysis (chair: Michael Hayes)

17 (p)	Evaluating your Re-ID Method for Robustness: Consistency, Domain-Shift and Corruption	Jerome Williams, Ben Yip, Tristrom Cooke and Connor Luckett
52 (p)	Robust 360° Visual Tracking with Dynamic Gnomonic Projection	Hao Peng and Fanglue Zhang
59 (p)	Deep Learning-Based Buoyancy Prediction for Intelligent Mussel Farm Monitoring	Carl McMillan, Junhong Zhao, Bing Xue, Ross Vennell and Mengjie Zhang
70 (p)	Causal Behaviour Modelling for Player Action Prediction in Rugby League Using Graph Neural Networks	Jiaxuan Wang, Ruigeng Wang, Gael Gendron, Shahrokh Heidari, David Arturo Soriano Valdez, Gibran Alfonso Zazueta Cruz, Stephen Hallett, Mitchell Rogers, Alexander Woodward, Alfonso Gastelum Strozzi, Riki Mitchell, Mengjie Zhang, Michael Witbrock and Patrice Delmas
73 (v)	AIM-CP: AI-based Multi-Visual Components Analysis for Crowd Dynamics Prediction	Elham Basunduwah, Muhammad Mohzary, Sejun Song and Baekyoung Choi

Session 8: Augmented Reality (chair: Rick Millane)

22 (p)	Accurate 3D Grapevine Structure Extraction from High-Resolution Point Clouds	Harry Dobbs, Casey Peat, Oliver Batchelor, James Atlas and Richard Green
69 (p)	Investigating Feature Clustering for Generalised 3D Point Cloud Part Segmentation	Nirmal Das, Steven Mills, Lech Szymanski, Xinyu Jiang and Tapabrata Chakraborti
67 (p)	Video Quality Metric Compatible With Psnr Considering Recent Knowledge of Peripheral Characteristics of Human Vision	Anastasia Mozhaeva, Igor Vlasuyk, Aleksei Potashnikov, Vladimir Mazin and Lee Streeter
30 (p)	Evaluating Accuracy and Efficiency of Fruit Image Generation Using Generative AI Diffusion Models for Agricultural Robotics	Kun Zhao, Minh Nguyen and Weiqi Yan

Poster Presentations

7 (p)	Design and Prototype of a High Order Adaptive Optics System for University of Canterbury’s 0.61 m Boller & Chivens Telescope	Emma Johnson, Richard Clare, Joe Chen and Stephen Weddell
11 (p)	Two-stage punch-code recognition using a CNN and the Hough transform	Kapila Pahalawatta and Jaco Fourie
15 (v)	Addressing Retail Object Detection Challenges through Dual Network Integration	Caleb Stilp and Cheol-Hong Min
16 (p)	A Frequency-Domain Approach for Detecting Overhead Conductors in Aerial Images	Zhicheng Pan, David Wilson, Martin Stommel, Alejandro Castellanos and Ben van Vliet
25 (p)	Image Segmentation with a Deep Declarative Network	Lech Szymanski, Steven Mills, Garth Wales and Hayden McAlister
26 (v)	Which Region Proposal to Choose? A Case Study for Automatic Identification of Retail Products	Bikash Santra and Dipti Prasad Mukherjee
47 (p)	Classification of Bloodstain Spatter Patterns with Convolutional Neural Networks	Rosalyn Rough, Swapnil Bhagat, Andrew Bainbridge-Smith, Oliver Batchelor and Richard Green
56 (p)	Automatic Quantification of King Salmon Farm Net Occlusion via Biofouling Detection	Qinglan Fan, Ying Bi, Bing Xue, Jane Symonds, Lauren M. Fletcher and Mengjie Zhang
68 (p)	Interactive Engagement with Place and Culture	Steven Mills, Charlie Templeton and Deborah Goomes
74 (p)	Automated Flow Tracking Software for Capillary Microfluidic Devices	Rhys Marchant-Ludlow, Ciaran Moore and Volker Nock
77 (v)	Modelling methane emissions from rice paddies using machine learning	Abira Sengupta and Nuzla Ismail
80 (v)	Automatic Detection of Peripheral Facial Landmarks Using 3D Facial Data	Mehraan Chowdhury, Mohammed Bennamoun, Farid Boussaid and Syed Mohammed Shamsul Islam
85 (v)	Design and Validation of Procedurally Generated Personalised Spatial Reasoning Tests	Owen Xu, Tim Li, Nasser Giacaman, Zixuan Wang and Burkhard Wuensche
88 (v)	Spatial Mini Golf: Game-Based Spatial Intelligence Testing and Training Using VR and Non-VR Platforms	Patrick Oliver, Andy Fong, Zixuan Wang and Burkhard Wuensche
90 (p)	Wavefront Estimation from a Shack-Hartmann Wavefront Sensor Using a Convolutional Neural Network	Kylen Patel, Le Yang and Richard Clare

The 39^th Conference on
Image and Vision Computing New Zealand

4-6 December 2024
Christchurch, New Zealand and Online

Key Notes

Oral Presentations

Poster Presentations