source code - http://archive.pkmital.com https://archive.pkmital.com computational audiovisual augmented reality research Fri, 06 Feb 2015 05:45:51 +0000 en-US hourly 1 https://wordpress.org/?v=6.9.4 Handwriting Recognition with LSTMs and ofxCaffe https://archive.pkmital.com/2015/02/06/handwriting-recognition-with-lstms-and-ofxcaffe/ https://archive.pkmital.com/2015/02/06/handwriting-recognition-with-lstms-and-ofxcaffe/#respond Fri, 06 Feb 2015 04:41:49 +0000 http://pkmital.com/home/?p=1836

Long Short Term Memory (LSTM) is a Recurrent Neural Network (RNN) architecture designed to better model temporal sequences (e.g. audio, sentences, video) and long range dependencies than conventional RNNs [1]. There is a lot of excitement in the machine learning communities with LSTMs (and Deep Minds’s counterpart, “Neural Turing Machines” [2], or Facebook’s, “Memory Networks” [3]) as they overcome a fundamental limitation to conventional RNNs and are able to achieve state-of-the-art benchmark performances on a number of tasks [4,5]:

  • Text-to-speech synthesis (Fan et al., Microsoft, Interspeech 2014)
  • Language identification (Gonzalez-Dominguez et al., Google, Interspeech 2014)
  • Large vocabulary speech recognition (Sak et al., Google, Interspeech 2014)
  • Prosody contour prediction (Fernandez et al., IBM, Interspeech 2014)
  • Medium vocabulary speech recognition (Geiger et al., Interspeech 2014)
  • English to French translation (Sutskever et al., Google, NIPS 2014)
  • Audio onset detection (Marchi et al., ICASSP 2014)
  • Social signal classification (Brueckner & Schulter, ICASSP 2014)
  • Arabic handwriting recognition (Bluche et al., DAS 2014)
  • TIMIT phoneme recognition (Graves et al., ICASSP 2013)
  • Optical character recognition (Breuel et al., ICDAR 2013)
  • Image caption generation (Vinyals et al., Google, 2014)
  • Video to textual description (Donahue et al., 2014)

The current dynamic state … Continue reading...

The post Handwriting Recognition with LSTMs and ofxCaffe first appeared on http://archive.pkmital.com.

]]>
https://archive.pkmital.com/2015/02/06/handwriting-recognition-with-lstms-and-ofxcaffe/feed/ 0
Short Time Fourier Transform using the Accelerate framework https://archive.pkmital.com/2011/04/14/short-time-fourier-transform-using-the-accelerate-framework/ https://archive.pkmital.com/2011/04/14/short-time-fourier-transform-using-the-accelerate-framework/#comments Thu, 14 Apr 2011 11:06:55 +0000 http://pkmital.com/home/?p=591 Using the libraries pkmFFT and pkm::Mat, you can very easily perform a highly optimized short time fourier transform (STFT) with direct access to a floating-point based object.

Get the code on my github:
http://github.com/pkmital/pkmFFT
Depends also on: http://github.com/pkmital/pkmMatrixContinue reading...

The post Short Time Fourier Transform using the Accelerate framework first appeared on http://archive.pkmital.com.

]]>
https://archive.pkmital.com/2011/04/14/short-time-fourier-transform-using-the-accelerate-framework/feed/ 3
Real FFT/IFFT with the Accelerate Framework https://archive.pkmital.com/2011/04/14/real-fftifft-with-the-accelerate-framework/ https://archive.pkmital.com/2011/04/14/real-fftifft-with-the-accelerate-framework/#comments Wed, 13 Apr 2011 23:05:05 +0000 http://pkmital.com/home/?p=576 Apple’s Accelerate Framework can really speed up your code without thinking too much. And it will also run on an iPhone. Even still, I did bang my head a few times trying to get a straightforward Real FFT and IFFT working, even after consulting the Accelerate documentation (reference and source code), stackoverflow (here and here), and an existing implementation (thanks to Chris Kiefer and Mick Grierson). Still, the previously mentioned examples weren’t very clear as they did not handle the case of overlapping FFTs which I was doing in the case of a STFT or they did not recover the power spectrum, or they just didn’t work for me (lots of blaring noise).

Get the code on my github:
http://github.com/pkmital/pkmFFTContinue reading...

The post Real FFT/IFFT with the Accelerate Framework first appeared on http://archive.pkmital.com.

]]>
https://archive.pkmital.com/2011/04/14/real-fftifft-with-the-accelerate-framework/feed/ 24