Purely AI News: For AI professionals in a hurry
July 22, 2020
Researchers from Austria propose an AI system that reads sheet music from raw images and aligns that to a given audio accurately
Previously existing solutions for this capability either rely on fixed-size, small samples of sheet music images or expect a machine-readable representation of the score derived with optical music recognition. This research, lead by Florian Henkel of Johannes Kepler University, describes a system capable of tracking a whole page of sheet music at once, interpreting musical performances of any duration from beginning to end. While trackers that only exploit a fixed-size audio input are usually unable to differentiate between repeated notes if they go past a given range, the proposed method has no issue even in scores that cover long stretches of time in the audio, the researchers claim.

Behind the scenes, such systems are usually divided into two core components: Image segmentation and audio interpretation. This framework combines the two into one architecture that is tasked with predicting a segmentation mask (consisting of all the different shapes detected and identified) for the given score image that corresponds to this currently played music. The audio is interpreted using a U-Net model architecture and that, in turn, utilizes a conditioning mechanism that directly modulates the activity of feature detectors that process the score image. This unique approach of tying these two independent pieces together makes the whole system trainable as a unit and much more efficient.

The system is real-time capable due to a constant runtime per step. It compares favorably with existing baselines on synthetic polyphonic piano music and sets the new state of the art for sheet-image-based score following in terms of temporal alignment error. But, there are still problems in generalization for actual piano records. Although the model shows a far more precise synchronization in most cases, we are seeing a decline in performance over different recording conditions. Addressing these shortcomings and more, the team says "future work will also require testing on scanned or photographed sheet images, to gauge generalization capabilities of the system in the visual domain as well". They add that "There is currently no dataset consisting of scanned sheet images with precise notehead to audio alignments, it will be necessary to curate a test set. The next step towards a system with greater capabilities is to either explicitly or implicitly incorporate a mechanism to handle repetitions in the score as well as in the performance."
LinkedIn



Aug. 2, 2020

Sample Factory, a new training framework for Reinforcement Learning slashes the level of compute required for state-of-the-art results

23
July 31, 2020

Intel joins hands with researchers from MIT and Georgia Tech to work on a code improvement recommendation system, develops "An End-to-End Neural Code Similarity System"

22
July 25, 2020

Google's tensorflow-lite framework for deep learning is now more than 2x faster on average, using operator fusion and optimizations for additional CPU instruction sets

21
July 23, 2020

Fawkes: An AI system that puts an 'invisibility cloak' on images so that facial recognition algorithms are not able to reveal identities of people without permission

20
July 21, 2020

WordCraft: A Reinforcement Learning environment for enabling common-sense based agents

18
July 20, 2020

A designer who worked on over 20 commercial projects for a year turns out to be an AI built by the Russian design firm Art. Lebedev Studio

17
July 19, 2020

Microsoft is developing AI to improve camera-in-display technology for natural perspectives and clearer visuals in video calls

16
July 18, 2020

Microsoft and Zhajiang Univ. researchers create AI Model that can sing in several languages including both Chinese and English

15
July 18, 2020

New event-based learning algorithm 'E-Prop' inspired by the Human brain is more efficient than conventional Deep Learning

14
July 17, 2020

Scientists from the University of California address the false-negative problem of MRI Reconstruction Networks using adversarial techniques

13
July 16, 2020

A new technique of exposing DeepFakes uses the classical signal processing technique of frequency analysis

12
July 16, 2020

New AI model By Facebook researchers can recognize five different voices speaking simultaneously, pushes state-of-the-art forward

11
July 15, 2020

Researchers from Columbia Univ. and DeepMind propose a new framework for Taylor Expansion Policy Optimization (TayPO)

10
July 14, 2020

Federated Learning is finally here; Presagen's new algorithm creates higher performing AI than traditional centralized learning

9
July 14, 2020

Fujitsu designed a new Deep Learning based method for dimensionality reduction inspired by compression technology

8
July 12, 2020

Databricks donates its immensely popular MLflow framework to the Linux Foundation

7
July 12, 2020

Microsoft Research restores old photos that suffer from severe degradation with a new deep learning based approach

6
July 12, 2020

Amazon launches a new AI based automatic code review service named CodeGuru

5
July 12, 2020

IBM launches new Deep Learning project: Verifiably Safe Reinforcement Learning (VSRL) framework

4
July 12, 2020

DevOps for ML get an upgrade with new open-source CI/CD library, "Continuous Machine Learning (CML)"

3
July 12, 2020

DeepMind's new open-sourced Reinforcement-Learning library, dm_control, packs a simple interface to common RL utilities

2
July 12, 2020

Learning to learn: Google's AutoML-Zero learns to evolve new ML algorithms from scratch

1