Running your first application
Input types
Input on HoloLens is different from other platforms. Unlike other platforms, the primary means of interaction are three non-traditional systems:
-
Gaze: Where the user is looking in the world.
-
Gesture: Hand signals used to signify commands to the system.
-
Voice: Short, spoken commands, and longer free-form dictation.
Gaze
The Gaze refers to where the user is looking. On HoloLens, this is fairly precise and can be used to select GameObjects in the world. This can be used to direct commands at specific GameObjects rather than every GameObject in the Scene.
Microsoft provides useful documentation on Gaze indicator and Gaze targeting pages.
Gesture
A Gesture is a hand signal interpreted by the system. These can be used to represent commands. There are several built-in Gestures you can use in your application, as well as a generic API to recognize custom Gestures. Both built-in Gestures and custom Gestures (via API) are functional in Unity.
Built-in Gestures:
-
Tap: With a closed hand and extended thumb and forefinger, press thumb and forefinger together. This is commonly used as a select command on the HoloLens.
-
Double Tap: Two Tap Gestures in rapid succession.
-
Hold: A Tap Gesture, keeping the forefinger and thumb together for one second or more.
-
Manipulation: A Hold Gesture, followed by a translation in space. Distances from the Hold position are reported in the Gesture.
-
Navigation: A Hold gesture, followed by a translation in space. This may be constrained to one or more of the x, y, and z planes, and reports a value from –1 to 1 for each axis.
For more information about gestures, refer to Microsoft’s documentation on Gesture design.
Voice
Voice input on HoloLens is provided by the Windows 10 API. Unity supports three styles of input:
-
Keywords: Simple commands or phrases set up in code which generate events. This allows you to quickly add voice commands to an application where localization is not an issue. This functionality is provided by the KeywordRecognizer.
-
Grammars: A table of commands with semantic meaning which you can localize. Grammars are configured from an xml grammar file (.grxml). See Microsoft’s documentation on Creating Grammar Files for more information on the file format. Grammar recognition functionality is provided by the GrammarRecognizer.
-
Dictation: A more free-form text-to-speech system that translates longer spoken input into text. Dictation recognition on the HoloLens is only active for short periods of time to prolong battery life. Dictation requires a working Internet connection and is provided by the DictationRecognizer.
For more information about voice input, refer to Microsoft’s documentation on Voice design.
Did you find this page useful? Please give it a rating:
Thanks for rating this page!
What kind of problem would you like to report?
Is something described here not working as you expect it to? It might be a Known Issue. Please check with the Issue Tracker at issuetracker.unity3d.com.
Thanks for letting us know! This page has been marked for review based on your feedback.
If you have time, you can provide more information to help us fix the problem faster.
Provide more information
Running your first application