Skip to main content

Convolutional Neural Network: The evolution of facial expression recognition AI towards a model operating in your browser

convolutional neural network

A MorphCast secret revealed

The R&D team at MorphCast, along with two university research institutes, has created an extremely compact yet powerful AI model for facial expression recognition, capable of running in the browser of various devices such as smartphones, tablets, and PCs. This was made possible through a proprietary architecture of “deep” convolutional neural networks (CNN).

What does DNN stand for?

In machine learning and artificial intelligence, a Deep Neural Network (DNN) is a type of artificial neural network with multiple layers between the input and output layers which enables the learning of high-level abstractions from data​.

What does CNN stand for?

It is a class of deep neural networks, most commonly applied to analyzing visual imagery. In a CNN, the network employs a mathematical operation called convolution to process parts of the input data, making it especially effective in identifying spatial relationships in data like the arrangement of pixels in images. This capability makes CNNs extremely effective for tasks like computer vision applications.

What does “convolutional” mean?

In the context of a Convolutional Deep Neural Network (CNN or ConvNet), the term “convolutional” refers to the convolutional layers that are used in the network.

MorphCast Neural Networks

Positioned downstream to the MorphCast face detector is a “deep” neural network, specifically a Convolutional Neural Network (CNN). This network processes the cropped face output from the face detector, delivering results pertinent to the module in question (e.g. emotions, affects, attention, etc.).

This neural network is custom-designed, meticulously crafted to adhere to stringent size (<1 MB) and execution time requirements. Despite operating at a lower resolution, it maintains a rapid prediction timeframe (approximately 30ms on PC), alongside adequate accuracy for the designated task. Unlike other products, this setup achieves a delicate balance between accuracy and processing time, even at such reduced resolutions while maintaining a high level of accuracy.

Proprietary Architecture

The bespoke architecture of MorphCast is multi-task oriented, with a fundamental segment (the deepest levels) shared across all face analysis modules. This shared foundation not only minimizes the model size but also trims down processing time. For instance, operating four modules simultaneously instead of just one elevates the CPU or GPU load by merely 30%, a stark contrast to the anticipated 300% increase. This streamlined structure underscores the architecture’s efficiency, fostering a more resourceful processing environment.

Post-processing

The output generated by the neural network undergoes a subsequent layer of post-processing to eliminate noise and conduct beneficial temporal post-processing. This stage can be tailored to optimally extract valuable information from the data, thereby significantly enhancing the utility derived by the user. This feature sets our offering apart, as comparable products in the market lack this refined level of post-processing, underscoring the superior value proposition for our customers.

Strengths

Weaknesses

Interested in more science behind MorphCast? Read our articles on Paul Ekman’s Basic Emotions, and Russel’s Circumplex Model.

About the author

Latest from our Blog

See all articles See all articles
  • The Hidden Impact of Unseen User Stress in Digital Experiences
    AI and Humanity May 1, 2026

    The Hidden Impact of Unseen User Stress in Digital Experiences

    Why Mental Health Matters More Than We Think in Digital Interactions Every May, Mental Health Awareness Month brings attention to…

  • Purpose of artificial intelligence
    AI and Humanity AI Technology Expert Content March 22, 2026

    Purpose of artificial intelligence

    Artificial intelligence becomes truly valuable when it helps us refine our abilities, not abandon them. Used well, AI saves time,…

  • Preventing It Before It Happens: AI and Emotional Signals Against Aggression
    AI Technology Expert Content Industry News & Trends Product Innovation October 17, 2025

    Preventing It Before It Happens: AI and Emotional Signals Against Aggression

    Data analysis made from second 24 – 29 of the video Headbutts to journalist, Rai 2 crew attacked in Ostia…