Você está na página 1de 32

Holistic Model

of Sound Perception
THOST
May 2015
Presented by: Vedanta Sutra dasa
Ideas by: HG Lila Purushottama dasa
HG Rasaraja dasa

Semantic Processing
Artificial general intelligence (AGI):
a machine can successfully perform any intellectual task that
a human being can.
from understanding language to writing award-winning books
from comprehending music to composing bandishes in different
ragas

Source: Wikipedia

Semantic Processing
What are our computers doing ??
At present, digital computers only perform token manipulation
(0/1 bits).
It does not lead to understanding or cognition.
The computers manipulate the symbols using a form of syntax
rules, without any knowledge of the symbol's semantics (that is,
their meaning).

Source: Gomatam R., 2009

State of the Art - what is lacking?


Computers can perform complex computations at lightening
speeds. E.g.,
finding logarithms
computing square roots
optimization using gradient algorithms
They do not feel exhausted or bored by this labour

State of the Art - what is lacking?


Computers can perform complex computations at lightening
speeds. E.g.,
finding logarithms
computing square roots
optimization using gradient algorithms
They do not feel exhausted or bored by this labour

But they are only performing bit operations, they way they
have been programmed

State of the Art - what is lacking?


What about AI ?
Automatic temperature control
Automatic OCR (optical character recognition)
Adaptive signal processing
Automatic music transcription
GPS navigation

State of the Art - what is lacking?


What about AI ?
Automatic temperature control
Automatic OCR (optical character recognition)
Adaptive signal processing
Automatic music transcription
GPS navigation

Only a bit more sophisticated programming


They are only doing as they have been programmed

State of the Art - what is lacking?


Automatic OCR (optical character recognition)
pixels of 0/1
learn features which are again 0/1

Source: CS, Boston University

State of the Art - what is lacking?


Hierarchy of simple computational models give rise to complex
understanding

State of the Art - what is lacking?


Hierarchy of simple computational models give rise to complex
understanding

This is REDUCTIONISM, an old argument


It has brought some empirically successful systems upto
certain accuracy, but no understanding of understanding
itself

State of the Art - what is lacking?


What about a computer program defeating a human
grandmaster in playing chess?

Source: Deutsch D., 2012

State of the Art - what is lacking?


What about a computer program defeating a human
grandmaster in playing chess?

The algorithms used by human and computer are not even


remotely similar.
The grandmaster can give exciting reasons for his steps and
can write a bestseller book on the game. But a computer does
not even know what it is doing apart from following a set of
instructions.

Source: Deutsch D., 2012

Semantic Processing
But no brain on Earth is yet close to knowing what brains do in
order to achieve any of that functionality. The enterprise of
achieving it artificially the field of artificial general intelligence
or AGI has made no progress whatever during the entire six
decades of its existence.

Source: Deutsch D., 2012

Is it possible?: two views


Machine can only act like it thinks and has a mind
AGI is not possible
Machine can think and have a mind
AGI requires a major breakthrough

I call the core functionality in question creativity: the


ability to produce new explanations

Source: Deutsch D., 2012

Goals of Semantic Processing


Creativity: the ability to produce new explanations
Causal Reasoning: for taking decisions

Current Sound Processing


We sing
Air column vibrates
Diaphragm of microphone
vibrates
Vibration converted to
electric signal

Electric signal moves the coil


Diaphragm of speaker
vibrates
Air column vibrates
We hear

Current Sound Processing


We are processing in the visual domain, not the sound domain
[Sir]
Even touch is playing an intermediary role
Corollary: frequency, phase are an artifact

Can we process sound in the sound domain?


Law of the Instrument:
if all you have is a hammer, everything looks like a nail
- Abraham Maslow

Quantifying Experiences
Experience is irreducible/holistic
Modern science tries to reduce it to elementary entities. Why?

Quantifying Experiences
Experience is irreducible/holistic
Modern science tries to reduce it to elementary entities. Why?
May be the main motivation for this is to be able to QUANTIFY
the experiences.
Quantification gives better
understanding (notes of music/raga)
portability (reproducing the experience)
control (manipulability)

E.g., the experience of seeing an elephant can be represented as numerical values


corresponding to color, shape, size, etc. Once this is done, it opens the world of unlimited ways
this information can be utilized, be it for image processing, object detection, tracking,
animation and so on.

A Gap in the Understanding


The machine can do speech recognition, then how are we
different?
Machine can do symbol manipulation, not semantic one.
It can recognize symbols/shapes, but it does not know what to
recognize; we have to tell it what to look for.
But humans can automatically detect what to recognize or
learn, because they know the meaning associated
E.g. One can speak a sentence in many ways - low pitch, high
pitch, whisper, alaryngeal etc. But no one has to train me how
to make sense out of it. But a system has to be trained
separately that this signal (symbol) corresponds to this
phoneme (another symbol)
Humans can reach meaning even with fragmentary input

A GAP in the Understanding


The modern sound processing methods can manipulate lower
level handles like frequency, phase, etc.
Music theory can manipulate/explain higher level concepts like
instrumentation, pitch, etc.
There is a big gap in between
No single theory can explain all the levels coherently

Goals of Semantic Processing


Creativity: the ability to produce new explanations
Causal Reasoning: for taking decisions
Single theory for explaining different levels of perception
E.g. a note is important because it gives a different form to the
composition, and not because of statistics of notes

How is it possible?
Form vs Substance
Every instrument has a form which identifies it. This form
manifests through the audio that we hear.
A message has a form which manifests through the alphabets
written on paper
Form is simultaneously same and different from the substance
Modern science is good in manipulating substance, but no idea
of form

Source: Gomatam R., 2014

How is it possible?
Hierarchy of form
Alphabet, word, sentence, article, idea
Note, composition, raga, idea/mood
Similar to the levels of sabda in Vedic understanding
The higher level guides the lower level, not the other way
round

Modeling the form


Let the form be modeled by a mathematical entity called infon
Infon carries the information from the source to the perceiver

Modeling the form


Let the form be modeled by a mathematical entity called infon
Infon carries the information from the source to the perceiver

SOURCE
INFON
PROJECTED FROM
PERCEPTUAL
SPACE

OBSERVER

GROSS MEDIA
CARRYING INFON

INFON
PROJECTED TO
PERCEPTUAL
SPACE

Modeling the form


An Infon is a relationship - between different objects, sense
experiences
Model (MQM?)
infon as a wave function
information transfer as projection operator
INFORMATION is a scalar measure associated with an INFON

Modeling the Experience


Earth, water, fire, etc. are gross media to carry infons
The medium has some infon (form)
The source puts its infon onto the medium, resulting in a
different infon
The observer has own perceptual space
The infon in medium gets projected onto the perceptual space
of the observer

How to define Information?


According to the present understanding, information is defined
w.r.t. the quantity the observer is interested in
E.g. in Shannons information, the probabilities are the ones
associated with the patterns the receiver is expecting
E.g. given a green tennis ball
Red vs Green
Tennis vs Ping-pong
Size

Further directions
Dimension of infon?
Can we model it completely?

Conclusion

Present computation (AI) is syntactic, not semantic


AGI requires creativity and causal reasoning
Distinction between substance from form
Model form as infon, which carries information
Experience as projection on perceptual space
Information depends on what the perceiver is interested in
Higher level in the hierarchy guiding the lower level

Bhagavat Sankhya approach of


H.G. Rasaraja Prabhu
Syntactic TOKENS to Semantic SYMBOLS
A different world view: we are interpreting things as TOKENS, but
they can also be conceived of as SYMBOLS
physically characterize macroscopic objects in terms of their
relational properties, as SYMBOLS with semantic content

Source: Gomatam R., 2009

Você também pode gostar