首页 / 专利库 / 电脑零配件 / 接口 / 用户界面 / 字符用户界面 / Multi-media handling system

Multi-media handling system

阅读:726发布:2021-12-24

专利汇可以提供Multi-media handling system专利检索,专利查询,专利分析的服务。并且A text processing system has display of text as a page of characters. The user interface comprises a set of editing actions that are activatable by mouse and/or keyboard actuations. Furthermore, secondary display modes are provided for audio or fax, which can be edited by at least a subset of the text editing actions. Representation of audio is as pseudo characters.,下面是Multi-media handling system专利的具体信息内容。

A text processing system comprising pagewise text display means as based on a character representability, supported by text processing means and text storage means and having a user interface featuring a set of text manipulating actions activatable through mechanical user actuation of an appropriate input means, characterized in that said display means has an audio display mode as based on linewise pseudo character representation and in said audio display mode featuring at least a first subset of the set of text manipulating actions operating on displayed audio in similar way as on displayed text and each of the latter actions having a mapped effect on the actual stored audio corresponding to displayed pseudo characters.A text processing system as claimed in Claim 1, wherein said audio is voice at a linewise display scale that is comparable to text display, and comprising delimiters as signalizations of detected silence instants.A text processing system as claimed in Claim 1 or 2, wherein said display means has a fax display mode as based on linewise representation of said fax and in said fax mode featuring at least a second subset of the set of text manipulating actions operating on a displayed fax area in similar way as on displayed text.A text processing system comprising pagewise text display means as based on a character representability, supported by text processing means and text storage means and having a user interface featuring a set of text manipulating actions activatable through mechanical user actuation of an appropriate input means, characterized in that said display means has a fax display mode as based on linewise facsimile representation and in said fax display mode featuring at least a second subset of the set of text manipulating actions operating on displayed fax in similar way as on displayed text and each of the latter actions having a mapped effect on actual stored facsimile information corresponding to said facsimile representation.A text processing system as claimed in Claim 3 or 4 having skew correcting means for under user control correcting skew of a particular line of fax.A text processing system as claimed in Claim 3 or 4 and having orientation detection means for detecting a local orientation of a displayed fax text line for allowing word and/or line select by said actions along a so detected line.A text processing system according to any of Claim 3 to 6, wherein said fax is linewise represented as further pseudo characters.A text processing system as claimed in any of Claims 3 or 5 to 7, wherein said first and second subsets are identical.A text processing system as claimed in any of Claims 1 to 8, wherein said pseudo characters have two variable attributes assigned thereto, such as size and colour.
说明书全文

FIELD OF THE INVENTION

The invention relates to a text processing system comprising pagewise text display means as based on a character representability, supported by text processing means and text storage means and having a user interface featuring a set of text manipulating actions activatable through mechanical user actuation of an appropriate input means. Various professional and personal computer ranges have text processing systems. An exemplary reference is the "Common User Access Advanced Interface Design Guide" for the IBM Operating System/2, by IBM, June 1989, Document SY0328-300-R00-1089, pages 33-42. Text is represented as characters according to one or more formats, free-formatable, connected or non-connected. Manipulating actions have been designed in a wide variety as will be described hereinafter. In this way the formatting of the text can be effected in many ways on the level of character, word, line, block, window, pararaph, page or other, while various actions such as insert, delete, cut, clip, paste and others are useful. In the effecting of those actions, the multilevel structure of representation of the text has been found useful. The most commonly used input devices based on such mechanical user actuation are hard keyboard and mouse. However, various other devices could serve, such as tracker ball, touch screen, or writing stylus.

SUMMARY OF THE INVENTION

Amongst other things, it is an object of the invention to support representation of other communication media and their processing by means of an interface that to an appreciable extent is uniform among said various media. According to one of its aspects, the invention realizes this object by representing such media as a text metaphore in that said display means has an audio display mode as based on linewise pseudo character representation and in said audio display mode featuring at least a first subset of the set of text manipulating actions operating on displayed audio in similar way as on displayed text and each of the latter actions having a mapped effect on the actual stored audio corresponding to displayed pseudo characters. The time sequence of audio governs the progression of its display along successive lines. For easy accessibility the audio signal is represented as pseudo-characters such as tiny blocks with height according to audio intensity. In particular, space characters function as delimiters between contiguous audio segments that are separated by a pause. Such a pause can be detected as occurring in the audio input, such as the end of a word or sentence. In an alternative realization, any pause character is a default item in that selectively, audio may be input into selective positions, i.e. time instants, such as in filling in a form or questionary with audio answers. In this way, audio is indeed processed as a text metaphore. The correspondence among the various activations makes life easier for any experienced user. On the other hand, the mapping of various manipulating operations on various input devices need not be uniform among text and audio. And een for a single object, certain manipulations may be restricted to a particular input device as differing for othe manipulations and/or another object.

Advantageously said audio is voice at a linewise display scale that is comparable to text display, and comprising delimiters as signalizations of detected silence instants. The duration of a word or line of text would then correspond in display space to the display of that many characters in textual representation. Their correspondence in physical space would be readily visible to a user. The silence delimiters correspond to ends of words or sentences in a written text.

Advantageously said display means has a fax display mode as based on linewise representation of said fax and in said fax mode featuring at least a second subset of the set of text manipulating actions operating on a displayed fax area in similar way as on displayed text. Fax is defined as a text represented by a bit map, wherein the character/word content or format may be uncertain in any degree, and wherein in principle, any two-dimensional format is allowable. It has been found that composing in a fax page would be advantageous as well, such as transmitting a text together with non-textual material, supplementing received fax with typewritten text, moving parts of a facsimile page with respect to other parts, and many others. The subsets of actions relating to audio and fax pages, respectively, may be, but need not be identical. The difference between text and fax information, is that the system does not know the meaning of the fax information. Even with a character recognition feature, only an approximate knowledge is present.

Advantageously, said system has skew correcting means for under user control correcting skew of a particular line of fax. Word select is possible if the skew is small over the word length, which usually is the case. Line select is possible if the skew over the line length is small compared to the line height. If this is not the case, the operator may manually correct the skew; in-processor algorithms that are rather elementary have been described in US Patents 4,618,991 and 4,759,076. No editing in the sense of the present invention has been given by these references, however.

Advantageously, said system has orientation detection means for detecting a local orientation of a displayed fax text line for allowing word and/or line select by said actions along a so detected line. Whereas the earlier approach works if the adjustment required is not too great (say, up to some 20°) and is more or less uniform over a page, there could occur problems if the skew is very large (up to 90°) and/or is severely non-uniform over a page, such as would occur in handwriting. In that case, the orientation of the line is detected, such as has been described in US Patent 4,723,297, which may, in principle, be done separately for each line. The detection may be followed by correction, if feasible, or the editing may be executed directly on the slanting uncorrected line.

Various advantageous aspects are recited in dependent Claims.

BRIEF DESCRIPTION OF THE FIGURES

The invention will be further disclosed according to the appended Figures that successively show:

  • Figure 1 a general diagram of a computer system;
  • Figures 2-7 various display formats;
  • Figures 8-9 tables of actions.

DESCRIPTION OF A PREFERRED EMBODIMENT

Figure 1 is a general diagram of a keyboard-oriented computer system. There is a page display 20. Shown are a foreground window 22, a partially covered window 24 and a cursor 26. The display may be based on a standard character set, or it may be high definition in that for each character an arbitrary bit map may be chosen. Processor 28 may execute text processing as well as system software. Memory 30 stores data to be displayed and other data. Keyboard 32 receives user actuations to be translated into system actions. Mouse 34 can be used to input absolute or differential positions in conjunction with clicking actions on any of four buttons 42. Block 44 represents a microphone with associated conversion mechanism to n-bit audio samples, that may be encoded, modulated, filtered, and, as the case may be, used in a voice recognition subsystem. The value of n of course, as well as the frequency of the samples, audio processing and other technology relevant to digital audio, relate to intended quality. Block 46 represents a loudspeaker that may output digital received audio samples via error correction, demodulation and digital-to-analog conversion to continuous audio. In addition to this, audio may remain analog if editably stored on a suitable medium. In principle, all audio editing may be done on an analog level, such as replacing, inserting, dubbing, fading and superposing, inasmuch as the invention proper relates to the interfacing with signalization of the audio. In the Figure, no analog storage has been provided and all operation is digital. Further communication lines 36, 38, 40 interconnect the subsystems shown. The configuration shown can be expanded or shrunk to any feasible degree, by means of:

  • background memory
  • external data communication
  • adapted functionality of subsystems
  • other I/O, such as graphics I/O (also called -mouse- hereinafter, such as soft keys, joystick)
  • other storage, such as tape
  • other communication pattern, instead of 36, 38, 40.

For brevity, Figure 1 does not show the conversion of audio to pseudo characters. According to various discrimination thresholds, for the average audio amplitude, the pseudo-characters in the present embodiment may have any of four non-zero heights as shown in Figures 2-7. Of course, other numbers are feasible. The translation of a particular digitized amplitude to a geometric representation is by converting amplitude to an address code for a character generator that generates a pseudo-character in the same way as standard characters. Other representations are feasible, such as dots of various diameters (the diameter now linked to amplitude), blocks with various greyness levels and/or colours, or other. The mapping of the audio on the pseudo-characters may be two-dimensional, such as by discriminating both audio level and audio pitch. This would, for example, elucidate a dialog between a male voice and a female voice. The mapping coordinates could now be both size and colour of the pseudo-characters.

Figures 2-7 show various display formats. Figure 2 is a text format, such as according to 53 rows of 65 characters each. In the example, characters are in print, with left-side and right-side registration, uniform periodicity and uniform line height The content of this page is irrelevant to the invention as well as to the disclosure thereof. Now, the format need not be uniform. It may be given by a shape description instead of by the shape itself, such as in the well-known Postscript® system.

Figure 3 is a voice format. Each character represents a voice period of about 1/20 of a second. The height of a block corresponds to the amplitude or energy of the voice segment. The pseudo characters have about the same width as the characters of Figure 2. An interrupted line indicates the connected sequence. Other representations are possible. Colour could indicate raised pitch such as at the end of a questioning phrase. Also other voice characteristics may be indicated such as the respective speakers in a conversation that each have their own colour. Apart from the height other shapes of the pseudo characters would be feasible to indicate various natural or artificial characteristics. Examples are marks for end-of-sentence, echo, machine generated speech versus recorded speech. The representation can be used for voice, music, animal sound, machine sound, background noise from machines, natural noise, and other.

Figure 4 is a fax format that is organized according to standard horizontal lines and provided with word rasters. The fax may be represented as it is received or produced. Alternatively, it may be represented as a string of blocks of varying lengths. Such block that may itself be considered as a pseudo-character, may be detected by a known segmentation algoritm. These pseudo-characters may be represented in similar way as has been explained with respect to audio herebefore. The content of the fax message itself is irrelevant to the present invention as well as to the disclosure thereof. The advantage of using such pseudo characters is, for example that the content of the fax may remain unknown to an operator (who only selects particular parts, or executes other text processing for which the content of the text need not be known). Another example is that such representation discretely shows the empty spaces even if the full text could give rise to ambiguities. The raster shown may be omitted if advantageous. The representation may be a combination of the fax text itself in combination with the block structure. It may be feasible to alternate between the two representations. The case of slanted lines has been discussed earlier. The fax may comprise a combination of text or texts (at respective slanting angles) and non-textual material such as Figures.

Figure 5 shows at left a text page with a so-called audio sticker on it, representing a non-open voice sheet in iconized state. In this state, it is possible to play the stored voice relating to this sheet and also to record new voice (or other audio), which is thereupon joined to this sheet. The activating of this functionality is effected by accessing of icon 50 that shows a stylized quote mark. However, only an open sheet leads to full editing functionality. The right hand side of the display shows various items and windows that are irrelevant to the present invention and are mousewise accessible.

Figure 6 shows an opened voice sheet, cf. the downstepping of the dark strip at right-hand side through the select column, with respect to Figure 5. This open sheet allows full-featured editing. Instead of a text font there is a block font indicating the amplitudes. There is a cursor in the left upper corner in the same way as a text cursor. New input (by recording) is inserted at its positions. Just as with text the user can select some part and use the cut, copy and paste operations. This makes it an audio editor, being much more than a cassette recorder. The cursor can be placed at a special position by a mouse click. But the user can also make use of the feature, that the cursor moves during play. Pressing "stop" will leave the cursor at the current position.

If there is some part selected, the command is restricted to that selection. E.g. the user first wants to listen to a selection before deciding to cut it. Figure 7, by inverted video shows selection of a segment that has about 6½ lines of pseudo characters.

Figures 8-9 show tables of various actions. Figure 8 shows the actions and extent of selection. In the left column it is shown that selecting may be done by a particular clicking operation. Clicking a particular mouse button starts a brief delay, for example 500 milliseconds. If within this delay a further clicking occurs, the number of clicks is accumulated. If no clicking occurs within this delay, a subsequent clicking resets the accumulated sum (to one). The maximum accumulation is three in this example. The right column shows the effect. Single click selects the actual cursor position. Double click selects the word or pseudo word of the actual cursor position, that is, to left and to right the first space character act as delimiters to the selection. Triple click likewise selects the whole line of characters or pseudo characters. The last line shows the effect of dragging. Dragging is the same as clicking but holding the clicking button while moving the mouse (or rather, its on-screen representation) to an intended destination position. If the button is accidentally deactivated somewhere, that is the destination position. Dragging selects all (pseudo) characters between the start position and the destination position: this means all characters along the line of the start position to the right of the start position and all characters on the line of the destination position to the left of the destination position, and all intermediate lines completely. If the two positions are on one single line, only the interval on that line is selected. In principle, dragging could be combined with plural clicking, but this has not been considered.

Now, Figure 9 shows what happens with the amount of (pseudo) characters after selection according to the pattern of Figure 8. The set of commands in the left hand column may be activated by a particular dedicated key or button on the keyboard, by a software defined key on the keyboard, or by a soft key on a displayed soft keyboard. The set of commands is as follows:

  • a. type/record: the characters in the selected region are deleted from the page and the newly typed or recorded characters are inserted in the actual cursor position whil continuously keeping actual text closed-up.
  • b. command CUT: this deletes all (pseudo) characters selected, closes up the (pseudo) text, and puts the selected information on the so-called CLIPBOARD, which is a particular dedicated memory area that is always directly accessible from the keyboard. If a plurality of windows have been activated in succession, the information present in the clipboard always remains linked to the window that did originate such information. The clipboard itself does not provide cross-window transportability. If such transportability were required, a special additional command is required to directly associate the deleted characters with the destination window that must then be operated directly. If a succession of CUT commands is given in succession, the information deleted is stacked and can be recalled in a last-in-first-out sequence. In another realization a first-in-first-out organization might be preferable, however.
  • c. command COPY: this, likewise, puts the selected information on the clipboard, but without deleting the selected information at its original location.
  • d. command PASTE: if applicable, delete the selected information (if more than one pseudo character is selected) and insert the clipboard information on the position(s) so selected. If the latter information is too large, insertion will let the amount of information grow. On the other hand, the information amount will shrink if the new information is less than the original information.
  • c. command DELETE: the selected information is deleted, and the subsequent information closes up.

Of course, the information may be manipulated in various other ways than according to the above subset of activations, but nevertheless in combination therewith. For example, fax information may be underlined. For another, audio information may be dubbed, loudness-clipped or other. This could be done, either in the context where the pseudo-characters of the associated communication medium occur, or in a completely separate setting.

In conclusion, the uniform interface for various communication media produces an easier accessibility. For brevity, the software has not been extensively disclosed, as this was considered to closely parallel that for text only, which is standard in various packages that are available commercially.

高效检索全球专利

专利汇是专利免费检索,专利查询,专利分析-国家发明专利查询检索分析平台,是提供专利分析,专利查询,专利检索等数据服务功能的知识产权数据服务商。

我们的产品包含105个国家的1.26亿组数据,免费查、免费专利分析。

申请试用

分析报告

专利汇分析报告产品可以对行业情报数据进行梳理分析,涉及维度包括行业专利基本状况分析、地域分析、技术分析、发明人分析、申请人分析、专利权人分析、失效分析、核心专利分析、法律分析、研发重点分析、企业专利处境分析、技术处境分析、专利寿命分析、企业定位分析、引证分析等超过60个分析角度,系统通过AI智能系统对图表进行解读,只需1分钟,一键生成行业专利分析报告。

申请试用

QQ群二维码
意见反馈