Reflections on a career in AI and machine learning

Delaram

Delaram Behnami

Job: Senior Machine Learning Scientist, Getty Images

Delaram Behnami is a values-driven professional working at the leading edge of machine learning and AI. In her current position as a Senior Machine Learning Scientist at Getty Images, she is working at the intersection of supporting ethically generated AI imagery and ensuring content on the Getty Images platform is authentic to the real world. 

 

You’ve been working in AI since before it was mainstream. What changes have you seen?

We have come a long way since I began as a starry-eyed graduate student in this field over a decade ago. I started out very excited about the opportunities offered by AI, particularly for biomedical and health applications, which was the focus of my three degrees at UBC. Over the past decade, the rise of AI has accelerated dramatically. Not long ago, organizations had to build or purchase most of their tools and software from scratch. Computations were expensive and limited, and study sizes were small. But then, as compute and storage costs continued to drop and became much more affordable – enabling models to digest more data, unlocking deeper learning from training sets and subsequently more powerful models – we started to shift towards open-source solutions. Computer science and applied scientific papers proliferated and more companies emerged to build tools that abstracted away many of the building blocks needed. Quickly, academic papers and proofs of concept became products, some products started selling and generating revenue, which in turn brought in more cash and investments, and it all built up a lot of momentum very quickly, leading us to where we are today, where many people would consider an industrial revolution, with all its fun and not-so-fun socio-economic impacts!

Most recently, with generative AI, we are seeing – in real time – rapid and vast AI integration in various workflows and industries. 

In software engineering, AI assistants are disrupting how programmers develop code – not just to get syntax right, but to build entire applications, albeit imperfectly, based on natural language requirements and descriptions. 

In the creative world, we are witnessing an explosion of AI imagery and tools, thanks to advancements in AI models, cheaper computing power, and an unprecedented abundance of visual data. All these models and applications are evolving, and they’re evolving fast.

Even just a few years ago, AI-generated images did not fool anyone – they looked fake, and sometimes creepy! We have all seen images like those of a cute cat in a human outfit drinking bubble tea in space, as well as those of people with wonky eyes and extra fingers, and that general sense of uneasiness about them! Recent models, however, are increasingly faster and cheaper, and can produce higher-quality, more realistic photograph-like images. 

AI images flood the web at a daily rate of tens of millions, competing with real-world images. 

As model quality and believability are improving, it is becoming progressively harder for humans to distinguish a real image from an AI-generated one, with some recent studies suggesting that humans’ success rate might be as low as 62%. In the context of creative stock content, generated imagery and video undermine artists in two ways: first, by training on their data without compensating them, and second, by unfairly competing with their original work on marketplaces. In editorial photography, on the other hand, the stakes of generative AI are especially high. Images capturing conflict, political figures, influential personalities, important locations or events, or natural disasters carry immense weight. Introducing fabricated visuals in these genres doesn’t just mislead—it can actively distort reality and fuel the spread of disinformation. This has sparked a broader dialogue among policymakers, companies, creators, and other stakeholders on how to develop these tools in a responsible manner that amplifies creativity while ensuring a robust, sustainable and authentic creative ecosystem for the future. 

 

Tell us about your current position at Getty Images. 

I’m a Senior Machine Learning Scientist at Getty Images. Getty Images is a global visual media company and supplier of millions of creative stock and editorial photos and videos. It is a marketplace where businesses or individuals can browse, search and purchase content created by creators from around the world. Creators can submit their original work that respects specified legal and technical guidelines and be featured on Getty Images, where their work, which they own the copyright to, can be legally licensed for various needs, such as advertising campaigns.

My team (Artificial Intelligence, Machine Learning) and I work on the two sides of the generative AI coin: 

  • ensuring our customers are confident that imagery on our platform is real and authentic
  • providing customers with commercially safe ways to generate AI content through our own tools and services. 

Our Generative AI by Getty Images model was trained on licensed creative imagery, so none of our editorial content was included in the training set. Furthermore, it compensates creators whose original work has been used to train our model on an ongoing basis - not a one-time fee.

Our main objectives with this work are 1) to enable our customers to explore the power of generative AI without creating commercial risk and 2) to ensure this opens additional revenue streams for our contributor base.

Getty Images

 

What do you like most about your job?

I really enjoy applying scientific tools and tricks to solve real-world problems. I love collaborating with interdisciplinary teams made up of engineers, domain experts, scientists, academics and business professionals. I enjoy experimenting with the new tech gadgets and occasionally nerd out, although considering the speed of it all, it can be overwhelming! 

I enjoy my current job particularly because I find it highly relevant to our world’s evolving challenges, and to the imperative to present real events truthfully, and protect fair creative expression and content generation in the age of AI. I appreciate that our field has the potential to make a positive impact on the world – if we can be responsible with it.

 

Tell us about your academic journey.

I actually come from the world of biomedical AI research. I would describe my academic research field as the intersection of machine learning, computer vision, and medical imaging – or you could say, AI for disease screening and diagnostic assistance, and imaging guidance. 

The initial spark of interest in machine learning was ignited in my final-year Electrical and Computer Engineering (ECE) undergraduate Capstone project, under the supervision of Dr. Purang Abolmaesumi with Change Healthcare (formerly McKesson Imaging), a medical imaging company with a Canadian office in Richmond, BC. 

Capstone  Dr. Purang Abolmaesumi

We developed an AI tool to determine patient orientation in chest X-rays, to ensure the correct side of the body is selected for surgery planning. Believe it or not, there are actual records of surgeons cutting into the wrong side of a patient due to misreading an X-ray! Our solution resulted in a joint UBC-industry patent in 2016, which marked my introduction to this field.

Having developed a taste for this space, I pursued a master’s degree under the guidance of Dr. Abolmaesumi and Dr. Robert Rohling. My research involved utilizing statistical shape and pose AI models to align pre-operative CT (computed tomography) and MR (magnetic resonance) images of the lumbar spine to intra-operative ultrasound images. Fusing different imaging modalities with AI models can enhance visibility for clinicians to assist and guide epidurals and facet joint needle insertions.

At this point, I was fully invested in AI in medical images, not to mention the overall grad school atmosphere, including our lab (Robotics and Control Lab (RCL)) with all its ultrasound machines, phantoms, and surgical robots casually around, international conferences, and the whole thing! 

I was determined to pursue a PhD next, and deep learning was all the hype! I was fortunate to do my PhD research on applications of machine learning in heart disease. 

RCL

My PhD research meant a lot to me. My dissertation involved developing a machine learning framework to diagnose heart disease in echocardiography (aka echo, i.e., real-time ultrasound image streams of the heart). Echo is an extremely complex modality to capture and read, and the heart is a highly complex dynamic organ with numerous complications and nuances, so making sense of echo requires extensive training. If we can use AI to assist clinicians, particularly those without extensive training or those in rural areas, in capturing and understanding echo images, we may be saving lives. Our team, led by Dr. Abolmaesumi and Dr. Teresa Tsang worked closely with Vancouver Coastal Health (VCH). The project then expanded to all British Columbian regional health authorities in partnership with Providence Health Care and industry, funded by Canada Digital Supercluster. Our machine learning and clinical teams have since published dozens of peer-reviewed papers and received several patents.

During my senior years of PhD, in those early pandemic years, I also started working in industry. I worked as an intern and then as a Machine Learning Applied Scientist at Amazon (Special Projects), on a client project in the health-care space. As part of an interdisciplinary team of scientists, I contributed to the development of solutions that automate image acquisition, quantification and characterization of lab study results using computer vision and machine learning.

Seeking to engage more deeply in academic research, I took on the role of Research Manager at the UBC Department of Medicine over the last year-and-a-half of my PhD. My role involved Department- and Faculty-level strategic planning to bring data science to health and medicine in the academic setting, as well as research support. I worked closely with the founders of the Data Science in Health (DASH) Research Cluster, Department Head Dr. Anita Palepu, Associate Head of Research Dr. Tsang, and the Faculty of Medicine on initiatives to enable deeper and broader applications of data science in health research. These initiatives included enhancing literacy and access for researchers in health to data science tools and knowledge, inventorying health data assets, and grant development to support and sustain such activities. Additionally, in this role, with Drs. Tsang, Abolmaesumi, and Anna Meredith, we developed a research grant proposal for the prestigious CFI John R. Evans Leaders Fund (JELF) to support the technical infrastructure required for further research in machine learning applications for the detection and management of heart failure. 

This grant, approved in 2023, supports machine learning academic research aimed at improving provincial health outcomes related to heart disease.

Besides research, as any good (and economically aware) graduate student, I also TA’d for a few courses throughout the years (one for seven times!), organized and taught some technical and non-technical workshops, and tutored high school students in math and physics in my earlier years. In 2019-2020, with professors Dr. Matt Yedlin and Dr. Bhushan Gopaluni and my peers, Drs. Mohammad Jafari, Fatemeh Dezaki, and Lee Rippon, we developed a pilot machine learning course, which continues to be offered to graduate students in the Department of ECE.

 

Where did you work after graduating?

After completing my PhD, I decided to return to industry, so I took a position at VideaHealth, a Boston-based startup that uses AI to enhance diagnostics and improve clinical workflows in dentistry, based on dental imaging. As a Machine Learning Engineer, my team and I developed AI models to detect dental pathologies, previous treatments and relevant anatomies in dental radiographs (X-rays). This work ultimately led to the largest dental device FDA clearance at the time for VideaAssist, a computer-assisted device that analyzes intraoral X-rays and detects various dental findings. 

VideaAssist currently serves 40,000 clinicians daily, processing over 500 million X-rays annually! 

VideaHealth

As and Large Language Models (LLMs) gained more momentum, I became more curious about generative AI and decided to transition to an industry vertical with lower data privacy and sensitivity risks than medical and health data. In 2023, I joined Lily AI, a B2B AI startup based in California, specializing in the e-commerce and retail space. Lily’s offerings focus on enhancing product discoverability and personalization using machine learning, computer vision and natural language processing. Lily’s offerings include product attribute enrichment – for example, tagging products based on images and text to improve search relevance, and enable filters and facets. My work at Lily involved building, evaluating and optimizing multimodal AI. Our scope included fashion, home, and beauty, with customers including retail giants like Bloomingdale’s, Home Depot and Sephora.

And that brings us to today! I currently work at Getty Images, and also have a part-time engagement with the Golden Gate University (San Francisco) as a Research Thesis Supervisor, where I advise a handful of seasoned professionals enrolled in the Emerging Technologies Doctorate of Business Administration (DBA) in Generative AI. The program is designed for professionals, such as consultants and business and engineering managers, who aim to advance their careers by exploring the applications of generative AI in their area of expertise within a research setting. It is a fun way to stay involved in teaching and research!

 

Any advice for students starting out?

Get involved. Join a club or sport activity, TA for a course, do a co-op or an internship, but also grab a coffee with your classmates or labmates after a class or a meeting, book a room on campus and watch a game with your buddies, whatever it might be. 

Don’t pass up on opportunities to connect with people – it will help you get out of your own head! 

One thing I am glad I did in my education is that I got involved in a whole bunch of colourful activities at UBC over the years, and I like to think it totally paid off – I enjoyed myself! I served as the Graduate Vice Chair of the UBC Women in Engineering, and was the President of the UBC Electrical and Computer Engineering Graduate Student Association (ECEGSA) and a member of the UBC Persian Club. We hosted parties for Nowruz (Persian New Year) and Yalda (Persian celebration of the winter solstice), fundraising events, deep learning workshops, and screenings of soccer matches! With my friends and classmates, we would do after-hour jam sessions at the student lounge, karaoke nights, belly dancing classes, and mid-day walks down to Wreck Beach! It made my school years very fun, and it also taught me how to relate and interact with a wide range of people, how to communicate my ideas more effectively (which comes in handy in fields like engineering, where communicating abstract and complex ideas can be challenging), and how to hold small talk before an important corporate meeting starts! My point being, beyond the coursework and the ambition, find ways to connect – it’s worth it!

 

LinkedIn

Discover UBC Applied Science Alumni

Whether you’re a nursing, planning, architecture or engineering graduate, the alumni network is one of the most powerful benefits of your UBC education.

Get Connected

UBC is located on the traditional, ancestral and unceded territories of the xʷməθkʷəy̓əm people (Musqueam; which means 'People of the River Grass') and Syilx Okanagan Nation. The land has always been a place of learning for the Musqueam and Syilx peoples, who for millennia have passed on their culture, history and traditions from one generation to the next.

UBC Crest The official logo of the University of British Columbia. Arrow An arrow indicating direction. Arrow in Circle An arrow indicating direction. Caret An arrowhead indicating direction. E-commerce Cart A shopping cart. Time A clock. Chats Two speech clouds. Facebook The logo for the Facebook social media service. Social Media The globe is the default icon for a social media platform. TikTok The logo for the TikTok social media platform. Calendar Location Home A house in silhouette. Information The letter 'i' in a circle. Instagram The logo for the Instagram social media service. Linkedin The logo for the LinkedIn social media service. WhatsApp The logo for the WhatsApp social media service. Location Pin A map location pin. Mail An envelope. Telephone An antique telephone. Play A media play button. Search A magnifying glass. Arrow indicating share action A directional arrow. Speech Bubble A speech bubble. Star An outline of a star. Twitter The logo for the Twitter social media service. Urgent Message An exclamation mark in a speech bubble. User A silhouette of a person. Vimeo The logo for the Vimeo video sharing service. Youtube The logo for the YouTube video sharing service. Future of work A logo for the Future of Work category. Inclusive leadership A logo for the Inclusive leadership category. Planetary health A logo for the Planetary health category. Solutions for people A logo for the Solutions for people category. Thriving cities A logo for the Thriving cities category. University for future A logo for the University for future category.