Linear array of photodiodes to track a human speaker for video recording
dc.contributor.author | DeTone, D. | en_US |
dc.contributor.author | Neal, H. | en_US |
dc.contributor.author | Lougheed, R. | en_US |
dc.date.accessioned | 2013-06-28T15:25:57Z | |
dc.date.available | 2013-06-28T15:25:57Z | |
dc.date.issued | 2012 | en_US |
dc.identifier.citation | DeTone, D.; Neal, H.; Lougheed, R. (2012). "Linear array of photodiodes to track a human speaker for video recording." Journal of Physics: Conference Series 396(6): 62005. <http://hdl.handle.net/2027.42/98636> | en_US |
dc.identifier.uri | http://stacks.iop.org/1742-6596/396/i=6/a=062005 | en_US |
dc.identifier.uri | https://hdl.handle.net/2027.42/98636 | |
dc.description.abstract | Communication and collaboration using stored digital media has garnered more interest by many areas of business, government and education in recent years. This is due primarily to improvements in the quality of cameras and speed of computers. An advantage of digital media is that it can serve as an effective alternative when physical interaction is not possible. Video recordings that allow for viewers to discern a presenter's facial features, lips and hand motions are more effective than videos that do not. To attain this, one must maintain a video capture in which the speaker occupies a significant portion of the captured pixels. However, camera operators are costly, and often do an imperfect job of tracking presenters in unrehearsed situations. This creates motivation for a robust, automated system that directs a video camera to follow a presenter as he or she walks anywhere in the front of a lecture hall or large conference room. Such a system is presented. The system consists of a commercial, off-the-shelf pan/tilt/zoom (PTZ) color video camera, a necklace of infrared LEDs and a linear photodiode array detector. Electronic output from the photodiode array is processed to generate the location of the LED necklace, which is worn by a human speaker. The computer controls the video camera movements to record video of the speaker. The speaker's vertical position and depth are assumed to remain relatively constant– the video camera is sent only panning (horizontal) movement commands. The LED necklace is flashed at 70Hz at a 50% duty cycle to provide noise-filtering capability. The benefit to using a photodiode array versus a standard video camera is its higher frame rate (4kHz vs. 60Hz). The higher frame rate allows for the filtering of infrared noise such as sunlight and indoor lighting–a capability absent from other tracking technologies. The system has been tested in a large lecture hall and is shown to be effective. | en_US |
dc.publisher | IOP Publishing | en_US |
dc.title | Linear array of photodiodes to track a human speaker for video recording | en_US |
dc.type | Article | en_US |
dc.subject.hlbsecondlevel | Physics | en_US |
dc.subject.hlbtoplevel | Science | en_US |
dc.description.peerreviewed | Peer Reviewed | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/98636/1/1742-6596_396_6_062005.pdf | |
dc.identifier.doi | 10.1088/1742-6596/396/6/062005 | en_US |
dc.identifier.source | Journal of Physics: Conference Series | en_US |
dc.owningcollname | Physics, Department of |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.