ADVERTISEMENT FOR "PHILOSOPHICAL SEMANTICS": WITTGENSTEIN: 2016

What follows is a draft of the chapter on Wittgenstein from the book "Philosophical Semantics" to be published in 2016 by Cambridge Scholars Publishing.

- 3 -

WITTGENSTEINIAN SEMANTICS

Nun scheint mir aber, gibt es ausser der Arbeit des Künstlers noch eine andere, die Welt sub specie aeterni einzufangen. Es ist – glaube ich – der Weg des Gedankens, der gleichsam über die Welt hinfliege und sie so lasst, wie sie ist – sie von oben von Flüge betrachtend.

[Now it seems to me too, that besides the artist’s work there is still another way of capturing the world sub specie aeterni. It is – I believe – the way of thought, which so to speak flies over the world and leaves it as it is – observing it from above, in flight.]

Wittgenstein

My aim in this chapter is not so much to interpret Wittgenstein, as to reconstruct and sometimes revise his insights on meaning in a way that makes them more powerful and relevant than they may seem at first sight. What will be searched here is what in his own terminology could be called a surveillable representation (übersichtliche Darstellung) of the grammar of the concept-word ‘meaning’, particularly in regard to representative language. Before beginning, I would like to say something about what could be called the ‘semantic-cognitive link.’

Semantic-cognitive link

The most common viewpoint concerning the referential mechanism, which I intend to support in this book, is that referential expressions can only refer because of some intermediary link able to associate them with their reference. This view originated in classical antiquity. I defend the position that this link has a semantic-cognitive nature in the sense that it can always be considered from two contrasting perspectives: psychological and semantic.[1] From a psychological perspective, the link can be called idea, representation, intention, conception, thought, and cognition (Aristotle and Locke were examples of semanticists who adopted this perspective). From a semantic perspective, the link is more often called a sense, meaning, use, application, intension, connotation, concept, informative content, proposition, criterion, criterial or verificational rule (Stoics, Frege, Husserl and Tugendhat are examples of semanticists of this persuasion). Here is a diagram:

LANGUAGE

↓

(a) COGNITIVE LINK:

idea, representation, cognition, intention, conception, thought...

(b) SEMANTIC LINK:

sense, meaning, content of thought, intension, use, application, semantic rule, criterial rule, criteria, proposition…

↓

WORLD

At this point an old polemic arises: What is the appropriate link? Which set of terms should be excluded? Should we exclude psychological terms, so as not to contaminate semantics with empirical contingency? Or should we abandon a possible commitment to questionable abstract semantic entities, exchanging them for the more feasible concreteness of the psychological, able to justify the mental causality? Should we read the Critic of Pure Reason semantically or psychologically?

Many philosophers have reacted to this question on the assumption that each alternative excludes the other. In my view this is the big mistake. I see this assumption as a false dilemma that has generated too much philosophical confusion, for the psychological and semantic perspectives should be seen not as mutually exclusive alternatives, but as complementary and in some way indissociable ones.

The source of this illusion that the perspectives are irreconciliable lies in the fact that the abstract character of the semantic perspective is seen as committed to some form of realism about universals, while the cognitivist perspective is seen as committed to some kind of nominalism attached to the particularist contingency of the psychological. Since these ontological commitments are incompatible, the two alternatives also seem to be incompatible.

However, if we perceive that these ontological commitments are evitable, it becomes easier to conclude that the intermediary link between words and things can be approached in both ways without conflict. For this we must realize that when we consider the intermediate link from a semantic perspective we are not commited with the appeal to abstract entities in any realistic sense. For we are only leaving out of consideration the unavoidable fact that meaning is able to exist only insofar as it is spatio-temporally embodied in some specific psychological subject and treating meaning simply in abstraction of this embodiment.

To clarify the complementarity that I am suggesting: we can consider the intermediate link as both: (i) a cognitive link, consisting of elements that must be spatio-temporally realized as fortuitous intentional acts occurring in specific psychological individuals; (ii) a semantic link, referred to as something considered in abstraction from its spatio-temporal instantiations as an intentional act going on in some particular psychological individual in a definite time and space.

However, this abstraction cannot be made in a sense in which the semantic link is considered as in some way transcending the realm of specific psychological and physical beings, since it always needs some form of cognitive spatio-temporal intentional embodyment in order to be an object of consideration. Thus, the word ‘abstraction’ means here simply leaving out of consideration the natural association between a meaning and the psychophysical individuals who instantiate the meaning, and focusing on the signs that are able to convey this meaning insofar as they can be understood by some other psychological interpreter. This is the only way a semantic-cognitive link can be clearly made semantically independent of this or that cognitive instantiation.

A very simple example illustrates my point. When I recognize a patch of vermilion of cinnabar (a precisely characterized color), it is because the patch I see matches the memory image of vermilion that I stored in my long-term memory during earlier experiences. Now, when I speak of a general concept of vermilion of cinnabar, I am speaking of this image that may be made conscious in my mind, or of any other image of this color that may be made conscious in other minds,[2] insofar as these samples are qualitatively identical.[3]

In other words, against the idea that our semantic link is a type that is either a unique abstract Platonic entity or an abstract Platonic class of tokens qualitatively identical to one another, what I am proposing is that we conceive the semantic link in the sense of an arbitrarily chosen model, namely, as any token that stands for any other token that is qualitative identical to it.[4] In short, we can define a semantic link as:

A semantic link x = any occurrence of x chosen to serve as a model for any other occurrence of some x that is qualitatively identical to our model.

Since all these possible occurrences need to be psychological (and certainly also physical), we don’t need to transcend the domain of the psychological-physical in order to reach the semantic domain. Moreover, we don’t need to have an instantiation of the semantic type in any specially chosen psychological particular. What we really need is only that at least one psychological particular, regardless of which one, should embody the semantic cognition. But this condition, as we will see later, can easily be accommodated in our commonsense ontological framework.

This compromise solution is strengthened when we note that even some sub-items of (a) and (b) show an approximate correspondence to each other. Thus the psychological word ‘idea’ has meaning proximity with the semantic words ‘sense’ or ‘meaning’, as well as to ‘concept’; the psychological word ‘representation’ has meaning proximity with the semantic phrase ‘criterial rule’; the psychological word ‘mental image’ has meaning proximity with the semantic word ‘criterial configuration’; the psychological word ‘occurrence of thought’ has meaning proximity with the semantic words ‘proposition’ and ‘content of thought’.

Why meaning cannot be the reference

When we consider the semantic link, the words that more easily come to mind are ‘sense’ and ‘meaning’ (generally used as synonyms) as semantic or informational content. But what is sense or meaning? Perhaps the simplest answer is given by what may be called semantic referentialism, a doctrine that in its crudest form holds that the meaning of a linguistic expression is its own reference. This conception either denies the existence of a semantic link between word and object or minimizes its importance. Wittgenstein described this way of understanding meaning at the beginning of his Philosophical Investigations, where he commented on the so-called ‘Augustinian conception of language’:

These words, it seems to me, give us a particular picture of the essence of human language. It is this: individual words in language name objects – sentences are combinations of such names. – In this picture of language we find the roots of the following idea: Every word has a meaning. This meaning is correlated with the word. It is the object for which the word stands. (Wittgenstein 1984c, part I, sec. 1)

Wittgenstein’s aim in this passage was to object to semantic referentialism, a theory championed by himself in his first work, the Tractatus Logico-Philosophicus. According to his version of semantic referentialism, when completely analyzed, language proves to be composed of atomic propositions constituted by atomic names whose meanings would be the simple and indestructible objects necessarily referred to by them.[5]

Semantic referentialism has some intuitive appeal. After all, it is usual to explain the meaning of a word by pointing to objects that exemplify what it means. In our childhood, we learned what the word ‘chair’ means, because adults showed us examples of this artifact. And we learn the name of a particular woman because she introduces herself to us with her name. We learn what a word means or does not mean respectively through positive and negative examples of its application, which makes credible the idea that meaning is the object actually referred to. This view has at least the virtue of simplicity: ‘here is the name, there is its meaning.’[6] Notwithstanding the fact that this view has been debunked by ordinary language philosophers as based on a primitive and misleading idea of what are the mechanisms of reference and has shown itself a major source of pseudo-problems, its has been reasserted among proponents of metaphysics of reference (see, for instance, Salmon 1993).

There are strong well-known arguments against this naive referentialist view of meaning. The most obvious is that you cannot say of a meaning what you say of an object: if a pick-pocket steals my wallet I will not say that the meaning of my wallet was stolen, and to say that Abraham Lincoln was assassinated is not to say that the meaning of his name was assassinated.

Another argument is that many natural terms have the same reference, while their senses or meanings are obviously different: the singular terms ‘Socrates’ and ‘the husband of Xantippe’ point to the same man, although they clearly have different meanings. And the opposite seems to be the case with general terms: the predicate ‘...is fast’ in the statement ‘Bucephalus is fast’ allegedly refers to a singularized property of Alexander’s horse Bucephalus, and in the sentence ‘Silver is fast’ it allegedly refers to a singularized property of another horse, Silver. Although they are different horses, so that the speed of Bucephalus is not qualitatively the same as that of Silver, in both sentences the word ‘fast’ preserves precisely the same meaning.

The most decisive argument against the referentialist view of meaning, however, is more fundamental: it concerns the fact that even when a referential expression has no reference, it does not lose its meaning. The singular term ‘Eldorado’ and the general term ‘phlogiston’ do not have any reference, but by no means do they lack a meaning.

The failure of Russell’s atomistic referentialism

Well aware of difficulties like these, Bertrand Russell attempted to defend semantic referentialism in a minimalist fashion, taking into account only alleged atomic elements of language and the world. It is instructive to consider his attempt. For Russell, the meaning of at least some terms – called by him logically proper names – would have as their proper meanings their objects of reference. This could be the case, perhaps, for the word ‘red’. After all as he noted, a blind man is unable to learn its meaning (Russell 1994, pp. 194-5, pp. 201-2).[7]

However, it is untenable that the meaning of any word can be reduced to its reference tout court. Changing his example a little, suppose that someone demonstratively applies the word ‘vermilion’ to an occurrence of vermilion of cinnabar, which is a shade of red that the human eye practically cannot further subdivide. In this sense it is a simpler candidate for ‘simple’ than Russell’s red color, since it does not need to include gradations and rule its limits. Could such an occurrence be the meaning? There is an obvious reason to think that an occurrence of vermilion could not be its meaning: the absence of any identity criteria. When we consider the occurrence of vermilion – be it physically thought of as an externally given spatio-temporal aspect or property, or phenomenally thought of as an appearance, a sense-datum – the occurrence will always be different for each new experience. Thus, if the meaning of ‘vermilion’ is nothing but a detected occurrence, then each new occurrence of vermilion should be a new and distinct meaning – an intolerable conclusion!

Russell should have seen this problem, for he found a way to defend his view against such objections. But the cost of this was only of getting entangled in even worse difficulties. He suggested that the object-meaning of a logically proper name would be something immediately accessible – such as sense data picked out by pronouns like ‘this’ or ‘that’ – only as long as we maintain these sense data in our consciousness. This means that also the meaning only lasts as long as our personal experience of a word’s object of application! (Russell 1994, pp. 201, 203). But this is an extremely problematic answer, as it is clear that it leads to solipsism.[8] For what criteria of correction could we apply to fix this meaning, in order to see that reapplication of the same word to another occurrence of the sense data that would at least qualitatively be of the same sense data? Moreover, how could we insert this fugitive meaning of a proper name in our common language – a language composed of words whose meanings are permanently shared under the speakers?

Indeed, in our language, to know the meaning of a word like ‘vermilion’ demands at least the ability to recognize an occurrence of vermilion as being precisely similar to other occurrences of vermilion. But this recognition is not included in the idea that the meaning of the word is nothing but the occurrence of its reference. The concept of a term’s meaning requires essentially that this term should unify its different applications to the same referent, which is not in question here.

It is true that if the meaning of a word like ‘vermilion’ were the vermilion-type – understood as an abstract entity common to all occurrences (tokens) – we would be able to solve the difficulty pointed out above. But this solution would commit us to some form of realism or Platonism, raising justified suspicion of an unintelligible reification of the type into a topos atopos.

One alternative would be to consider the vermilion-type as being the class of occurrences of sense data that are precisely similar to each other. This reduces the risk of realism, but does not eliminate it, since classes are often seen as abstract entities. In addition, classes may be larger or smaller according to their number of members, while the meaning of the word ‘vermilion’ has no proper magnitude, neither increasing nor decreasing.

The most feasible alternative seems to be that we consider the meaning of ‘vermilion’ as some occurrence of vermilion that we are using as a model (which could be a sense datum or some property-trope in the outside world, and can if necessary be arbitrarily changed to another like it) or any other occurrence that is precisely similar to this model. So, if I recognize what is currently being offered as an occurrence of vermilion, it may be because I realize that this occurrence is precisely similar to others that were previously given to me as being those of vermilion resorting to a model whose copy I have stored in my memory, which gives me an awareness of it as a color qualitatively identical to colors I have previously experienced. Thus, recalling the various experienced occurrences of vermilion {R1, R2... Rn} and the model-copy Rm that I have stored in my memory, I can say that {R1 = Rm, R2 = Rm... Rn = Rm} and, therefore, that {R1 = Rm = R2}, etc., without resorting to any Platonic entity or to some multiplication of identities of identities or even to the concept of an intensionally defined set – usual problems pointed out against particularistic strategies for handling universals.

What this view amounts to is that what we call the meaning of the word ‘vermilion’ must be identified with a referential connection, namely, with a rule that relates cognitive experiences of occurrences of a color to occurrences of color that we in some way use as models, in order to produce an awareness of what is being experienced as being the same vermilion color in each case. This internal semantic cognition, however, is produced in association with ‘vermilion’, a word for an entity. In this way, both a reference and its word turn out to be interpersonally accessible once the precise similarity between occurrences allows for interpersonal accessibility and an implicit agreement necessary to create a linguistic convention, even if the semantic cognition in itself, as a matter of fact, is not interpersonally accessible.[9] However, it should be pointed out that the semantic rule that uses memorized models to identify any new instance of vermilion is independent of this or that particular occurrence of vermilion – it just instantiates occurrences that satisfy it. This view is one I believe to be workable.

But this view has a price: we see on reflection that by adopting it we have already left behind us the referentialist conception of meaning. Even to set the meaning of a word as simple as ‘vermilion’ we need to appeal to something that is more than a rough object of reference and is independent of it, namely, a semantic rule.

Even if Russel’s semantic referentialism is unsustainable, there is a lesson to be learned from its discussion. Our last suggestion saves an important idea derived from his referentialism, namely, the idea that the existence of an object of reference is necessary for the names of objects taken as simple in the context of linguistic praxis. Even understanding the phrase ‘simple object’ in a sense that is not absolute, and restricting it to a non-decomposable entity in the framework of some linguistic practice, as could be the case with the sense datum of red or of red as a trope (a spatio-temporally singularized property that may be given to experience) the conclusion is that for such ‘simple names’ to acquire meaning they need to have reference.

This is why, in an important sense, a blind man cannot learn the meaning of the word ‘red’: since the color red is in a sense simple[10] and its knowledge demands acquaintance, and since he cannot have this sensory experience, he cannot construct the conventional criterial rule responsible for the shared referential meaning of the word. At least in the case of this subrogate of a logically proper name restricted to a certain linguistic practice, the existence of some object of reference is indispensable. But this, certainly, doesn’t mean that the word’s reference is its meaning. What it means is only that in some cases a given object of reference is indispensable for the formation of the semantic rule whereby some word acquires its referential function.

Meaning as a function of use

We shall now move on to a second candidate for the semantic link: use or application. This candidate was privileged by Wittgenstein, who suggested that the meaning of a linguistic expression is its use (Gebrauch) or application (Verwendung). As he wrote in a famous passage of his Philosophical Investigations:

You can, for a large class of cases of use of the word ‘meaning’ – if not for all cases of its use – explain it like this: the meaning of a word is its use in a language. (Wittgenstein 1984c, part I, sec 43).

This suggestion applies to both words and sentences. It applies clearly to what has been called the directive meaning: the illocutionary forces of expressions, which establish kinds of interaction between speaker and hearer in speech acts and can be made explicit by performative verbs. Together with expressive meaning, aiming to express internal psychological states, these two kinds of meaning aren’t really important for us here, since we are interested in kinds of meaning able to link our linguistic expressions with their references – referential meaning. Our concern here, as was clear from the start, is the content of declarative sentences, which is the kind of referential meaning we call descriptive, factual, cognitive, informative or epistemic meaning, able to link language with the world and to be endowed with truth-value (Aristotle called it apophantic speech). These epistemic and referential meanings should be of philosophical importance because, since by relating language and world, they should have epistemological and ontological implications.

However, the identification of meaning with use doesn’t apply so easily to the informative or epistemic meaning of our expressions. Consider, for example, a declarative sentence like ‘The tide is high’. It is easy to imagine an illocutionary use for this, for instance: warning, informing. But by doing this we would revert to meaning as force. According to the theory of speech acts, all utterances must have the form F(p), where F expresses (explicit or implicit) illocutionary force, and p expresses a propositional content. Here we are not interested in F, even if F expresses assertive (illocutionary) force; we are instead interested in the use of p as p. But it is not very natural to speak of a use of a statement separately from its assertive force. The only sure way of approaching pure referential and epistemic meaning with an appeal to use consists in producing an acceptable extension of the concept of use, suggesting that what is at issue in the case of epistemic meaning is the use involved in the act of communication by means of which a speaker intends to share with a hearer his awareness of a real or possible fact. Thus, when a speaker says ‘The tide is high’, in addition to using this sentence with the illocutionary force of affirmation, for example, the use may be the spelling in which a propositional content is expressed, normally added to assertive force and made to communicate both, with the intention to reproduce a corresponding judgment (the same propositional content plus judicative force) in hearers’ minds.

To make clear what is at stake, we can isolate epistemic meaning from assertive force, as when we employ the Fregean device of expressing a sentence’s content only as being regarded, depriving it of any kind of illocutionary force, as we can do with the sentence ‘The dog has run away’ in the question ‘Has the dog has run away?.’ The spelling of this sentence expressing epistemic content, even if unasserted – is also a use.

But what about the hearer’s understanding of a statement? The hearer is surely not using any spelling of words in his understanding of its meaning. In order to sustain the view that even in this case meaning is use, we need to resort here to a second and bolder extension of the word ‘use’. It seems in fact possible to say that we use referential expressions simply by thinking them. When a hearer thinks the tide is high, it is possible to say that he actually uses this sentence in an epistemic mode by thinking it, for if the hearer understands the sentence ‘The tide is high’ or ‘[Anne believes] that the dog has run away,’ with or without words he is repeating this judgment or its content to himself. In normal communication, this use that a hearer gives to heard words by understanding them should be identical to what a speaker has in mind when using words to convey epistemic meaning. Hence, not only the epistemic sense as the speaker’s thought, but also of the hearer’s thought, could be viewed as internalized epistemic uses, with or without the addition of assertive force, which in its internalized form is called a judicative force (what Frege would call Urteilskraft). Finally, if Plato was right that discursive thought is ‘a silent dialogue of the soul with itself’, we can generalize this process of internalization and consider any cognitive act associated with language as use, even without being associated with communicative action.[11] We can call this the epistemic use of an expression, of which assertive and judicative forces are dispensable complementary elements.

It is not difficult to question the relevance of the two proposed extensions of the meaning of the word ‘use’ that I am employing in order to save the view of meaning as a function of use: though they are not wrong or confusing, they can be considered too cumbersome to be considered necessary. However, as will become clear, the view of meaning as a function of use retains for Wittgenstein a pragmatic advantage, namely, that of locating meaning in its proper place from the start: the normal linguistic praxis – the concrete speech-act situation – even in the usual mental praxis of thinking with words. This enables us to individuate the meaning of an expression in its natural place, where it exercises its real function, enabling us to achieve in this way the highest level of interpersonal corrigibility, not excluding or distorting anything. And this is what Wittgenstein’s identification of meaning with use is all about: It allows us to individuate meanings precisely as they are, while in doing philosophy we are too often prone to exempt and distort meanings in order to produce illusory insights. In this sense the maxim that meaning is use can help us in practicing what Wittgenstein called philosophy as therapy, which aims to untie the knots of thought tied by philosophers, as far as it brings our words back from their metaphysical holidays to their daily work (Wittgenstein 1984f, part I, sec 116).

Meaning as a kind of rule

A more basic difficulty arises when we understand that the identification of meaning with use cannot be one of meaning and episodic use tout court, namely, a mere spatio-temporal occurrence (token) of a linguistic expression, as each occurrence differs from others in its spatio-temporal location. If it were the case, each new occurrence would have a new meaning, which would make the number of meanings of any linguistic expression unlimited sparking in this way a semantic catastrophe.

There is, however, an intuitive alternative. We can understand the words ‘use’ (Gebrauch) or ‘application’ (Verwendung) as a way of use (Gebrauchsweise) or a way of application (Verwendungsweise), since the same word can be used many times in the same way. But what is the way of use? Well, it doesn’t seem to be anything other than something-with-the-form-of-a-rule (etwas Regelartiges) that commands episodic uses. Wittgenstein himself came to that conclusion in an important, though less well known passage of his last work, On Certainty:

The meaning of a word is its mode of application (Art der Verwendung) ... Hence, there is a correspondence between the concepts of ‘meaning’ and ‘rule’. (Wittgenstein 1984a, sec. 61-62)

In fact, to use a word meaningfully is to use it in accordance with its mode or way of use or application, it is to use it correctly, and to use an expression correctly, in the right way, is to use it in accordance with those rules that give it its meaning. By analogy, we can say that we use a screwdriver according to its way of use when we use it correctly, according to a rule, for instance, turning it clockwise in order to tighten a screw. Consider the following two examples of ways of use that I take from the linguee dictionary:

WAY OF USE: Apply several times to the skin and rub in for several minutes with a circular motion, until completely absorbed.

WAY OF USE: To color and cover up grey hair, we recommend 20 ml. 6% of a cream oxidizing agent in the proportion of 1 + 1.

Of course, what the ‘way of use’ presents is a rule or sequence or combination of rules for correct use of material. Now we see clearly that meaning can only be identified with use in the sense of something-of-the-kind-of-a-rule determinimg episodic uses. And what holds in general for word’s use also holds here for epistemic or referential use. In fact, the identification between meaningfulness and rule is more primitive. Consider the following two signs: ‘0O0’ and ‘Oà’. The second seems to us more meaningful, since we have the habit to link it with a rule pointing to a direction. Rules are the intrinsic source of meaningfulness.

Meaning as combinations of rules

However, why does Wittgenstein prefer to say that meaning is determined by rules? Why can’t the meaning of our linguistic expressions be identified with rules simpliciter? In my view, at least part of the answer was also approached by him with his analogy between language and calculation. (Wittgenstein 1984f, p. 168; also Wittgenstein 1982, pp. 96-97). This understanding is reinforced by the many otherwise unjustified considerations in his Brown Book of how complex sequences of rules could be followed in relatively simple language games. In use, linguistic expressions normally involve calculations, which should be understood as nothing more than combinations of rules or conventions. And the meanings that these expressions have can consist essentially in the combinations of more or less implicit and automatized conventions whose knowledge is tacitly shared among speakers.

Arithmetic can serve as an illustration here. If the meaning of a mathematical proposition is constituted by its proof, and the proof is a combination of rules, this meaning is also a combination of rules. Some people can do the multiplication ‘120 x 30 = 3,600’, for instance, by combining three rules, first multiplying 100 by 30, then multiplying 30 by 20, and finally adding 3,000 and 600 to get the result 3,600. The meaning, understood as the cognitive content of multiplying ‘120 x 30 = 3,600’, would be given by this and other methods of calculation, which together would amount to essentially the same general meaning, insofar as they proceed in different but complementary ways, i.e., proceeding from the same starting point and reaching the same result, in some cases by direct application of a single rule.

We see that what we called something-of-the-kind-of-a-rule can be understood as possible combinations of rules that bring us to a certain result. The meaning of a linguistic expression must also be the same as (i) a specific rule or (ii) one or more combinations of rules that determine a correct episodic use of the rules – which could be called a rule-complex (Regelkomplex). And the epistemic meaning of a linguistic expression is a rule or possible rule-complex that when applied or satisfied brings about a cognition of some state of affairs. (In this book, I will use ‘rule’ in a broad sense, including combinations of rules or rule-complexes. This is ultimately a justified extension of the term ‘rule’, since combinations of rules that produce the same results have the same functions as rules. Although non-reducible to shared conventions, such compositions of rules can still be seen as conventionally grounded, since they are constituted by elementary rules, namely, those usually established by convention. When someone says, as Wittgenstein sometimes does, that meaning is determined by rules, what can be reasonably meant by this is that epistemic meaning is the application of some rule-complex enabling us to reach some cognitive result, and nothing more.)

Since we are interested in the problem of reference, the meaning that will be considered will almost always be epistemic or referential, that is, concerning semantic-cognitive rules responsible for our linguistic awareness of what can be objectively given, which are also criterial rules. So, we are dealing with cognitive criterial rules responsible for the epistemic or referential significance of declarative sentences. Criteria are, in Wittgenstein’s terms, ‘what confers to our words their ordinary meanings,’ (Wittgenstein 1958, p.57). For him these semantic-cognitive rules are based on criteria, which are in a sense conditions that must be independently given in order to make us realize that something is the case. Using Wittgenstein’s own example, if someone says ‘It’s raining’ and this statement is true, this involves applying a criterial rule, a rule which requires that certain conditions must be given – say, drops of water falling from the sky – so that a cognition follows – say, awareness that it’s raining. And this resulting awareness, the cognition, could be understood, as we have already suggested at the end of the first chapter, as the availability to the system of what results from what we may consider as (effectively or only possibly) satisfied criterial conditions.

However, if an analysis of the appeal to use leads us to cognitive reference-rules, why appeal to use? Why not just start with an investigation of these rules and their combinations? The answer was already given. Language is an instrument of action, and meaning is there to enable action. Attention to correct use helps us to individuate meaning and to find the real cognitive-criterial rules or combinations of rules that must unavoidably be applied in order to confer meaning.

We can further elucidate this by appealing to a metaphor: when a post office sends a letter it has general indications as to the addressee’s state, city, and locality. These general indications are somewhat like the grammatical meaning of a sentence. Although necessary, they are not sufficient, since too many other sentences have the same grammatical meaning, since too many other addressees live in the same state and city. To reduce this vagueness, postmen also need the name of the street, the building number and the addressee’s apartment number. Without these singularizing details, it will be impossible to deliver mail to its proper destination. The same holds for cognitive meaning. What is decisive is the way of applying our expressions in the given context – not only the concrete, but also the whole discursive context, such as we find in philosophical texts. What an appeal to use does is to lead us to semantic details necessary to find what really matters. In other words: the more general traits of an expression’s way of use are less relevant, since they are common to many other expressions and for this reason are not able to individuate meanings. What matters at most are the more specific traits of meaning: ways of use. These are traits that expressions can only gain in contexts of application. Consequently, these can only be completely explored in linguistic praxis. For this reason, it is so important to consider occasions of use. These can be responsible for subtle semantic variations that an expression can have in different concrete or discursive contexts. As we will see, such subtle semantic distinctions are of particular importance for correcting or criticizing language, since they allow us to correct misconceptions arising from philosophical attempts to use words beyond the limits of meaningful language, particularly those belonging to the metaphysics of reference, which in most cases does it by systematically confusing semantic elements selected from the linguistic praxis.

Meanings and language games

There is more to be said about meaning as a function of use. The first thing to be noted is that a linguistic expression only makes sense when used within a system of rules. Here again we may appeal to a metaphor. We can compare a linguistic expression with a chess piece, and its use with a move in playing chess. When you move a chess piece, the meaning of the move is not given only by the rule that governs the piece’s move. What the move really means in the important sense of the word will depend on the game situation. It will be given by the contextually determined tactic, by the calculation of possible combinations of rules in anticipation of possible moves by the opponent and responses that could be made. This is a calculation made in playing chess and could be different in a different game, even if the pieces were the same.

Something not very dissimilar occurs with linguistic use. The linguistic rules governing what Wittgenstein called ‘superficial grammar’ could be compared with the rules for moving chess pieces. But grammatical rules – even those of some logical grammar – may not be what really matters. What matters more often are rules or rule-complexes belonging to what Wittgenstein called ‘deep grammar’ (1983c I, sec. 668), which may have more resemblance to combinations of rules that justify moves according to chess players’ tactical calculations, particularly when we consider what takes place in a dialogic speech.

As an example, one knows that the sentence ‘Caesar visited Calpurnia’ is grammatically correct, and one may even know that its logical form is aRb. But this will be of no help if one does not know who Caesar and Calpurnia were, what relationship they had, and cannot say when or why he could have visited her. Superficial grammar (or syntax) gives expression to a grammatical sense that is often the same for semantically different sentences. The rules and combinations of rules that constitute what is meant by a linguistic expression are more flexible, changing in accordance with the concrete and linguistic context.

Furthermore, in the same way that the rules responsible for a strategic move in chess depend on a context provided by the system of rules that constitutes the game of chess, the rules determining the application of linguistic expressions are able to produce meaningful utterances only when combined within a system of rules, called by Wittgenstein a language game or a linguistic practice. Language games can be characterized as linguistic systems that typically include syntactic, semantic and pragmatic rules. Examples of language games given by Wittgenstein are:

Giving orders and acting according to them, describing an object by its appearance or measures, informing… speculating about an event, making and testing hypotheses… making up a story, reading… solving a riddle, telling a joke, describing a landscape, acting, asking, thanking, cursing, greeting, praying, etc. (Wittgenstein 1984c, sec, 23)

But he also uses the same idea in a wider sense, pointing to more extended domains of language like:

The language of colors, the language of proper names, or even the important ‘knowing games’ from On Certainty, like the game of doubt and the languages of history, physics, chemistry and arithmetic. (Costa 1990, p. 50)

That is: it seems that almost any chunk of our language can be seen as a language game. Language games include themselves, one within another, like the case of Cantor’s theory of infinite numbers within the language of mathematics, and they can partially overlap one another, as when someone describes a landscape and by this means also tells a joke, insofar as we remain able to distinguish them (Wittgenstein 1984c, sec. 46-48). Fundamental is only that they remain interpersonally distinguishable.

The concept of language game or linguistic practice contains the concept of the speech act, systematically studied by J. L. Austin and John Searle, but it is much wider. This is why Wittgenstein was not mistaken as he said there are countless language games (1983c sec. 23).

By making the meanings of expressions the results of combinations of rules belonging to rule systems typified by language games, Wittgenstein was endorsing what was later called semantic molecularism: what we call the meaning of an expression does not depend on the expression in isolation (semantic atomism), nor on its insertion in language as a whole (semantic holism), but essentially on the context of linguistic practice in which it is located (molecular subsystem of language).

Finally, it is a mistake to believe that meaning is a matter of all or nothing. It is plausible to think that some part of a word’s meaning when used according to the rules of a language game extends to the group of games to which this game belongs, gradually merging with them.

In support of the idea that we use and give meaning to our expressions in language games, in the Brown Book Wittgenstein described natural language as a great nebula of language games, and later in the Philosophical Investigations, he compared it to a great old city:

The language of the adult presents itself to our eyes as a massive nebula, ordinary language, surrounded by more or less defined language games, which are technical languages. (Wittgenstein 1984e, p. 122)

Our language can be seen as an ancient city: a maze of little streets and squares, of old and new houses, and of houses with additions from various periods; and this surrounded by a multitude of new boroughs, with straight, regular streets and uniform houses. (Wittgenstein 1984c, sec. 18)

The nebula, the city, begins with what was built in its original center: the practices of ordinary language, expressing our humble commonsense wisdom. To this there come new insights, like those arising with the emergence of new scientific fields. As with games, the great old city can be subdivided in many distinct ways, one part including another, or one overlapping another.

There is a noteworthy relation of dependence here: learning and teaching these new practices, even the possibility of their understanding and creation, depends on prior acquisition of more basic practices governing ordinary life. This coheres with our principle of the primacy of modest common sense: Rejection of its proper assumptions by means of science is a questionable matter, and it is logically incoherent to reject them as a whole based on science.

A question that now arises is: in such circumstances, what criteria would we have for identifying meaning variations, or, in other words, what criteria would we have for identifying the language game in which an expression is used or even misused? Considering that language can be subdivided in multiple and varied ways, it seems that we can apply different criteria to the same move, insofar as we are able to interpersonally identify and share the criteria we are applying... But in this case, what guides us in choosing a criterion? Is this identification really possible?

My suggestion is that the identification of the language game under whose criteria a word is being used involves (i) the relevant factual and linguistic context in which the word is used, together with (ii) the speaker’s intention in using the word, insofar as this intention can be made interpersonally clear, even if only in a tacit way. It seems that these two factors allow the identification of the language game in which a speaker is using a linguistic expression as follows: if a speaker succeeds in giving a clear idea of the context and aim he has in using an expression, he is identifying the system of linguistic rules, the relevant language game for determining how he is using the expression, that is, the intended rules constitutive of its meaning. And if a hearer correctly identifies the speaker’s context and aims, he identifies the language game the speaker has in mind and will be able to understand correctly what the speaker means. (For example, if I tell my students that Aristotle said friendship is only possible among equals, the context shows everyone that I am playing a game of naming in which I intend to lecture on the famous Greek philosopher and not about some homonym.)

Meaning and form of life

There is a last important concept in the understanding of Wittgenstein’s explanation of meaning. The linguistic practices that form the nebula find their ultimate raison d’être as constituents of what Wittgenstein called a form of life (Lebensform). As he wrote:

…the word ‘language game’ is used here to emphasize the fact that speaking a language is part of an activity, or of a form of life. (Wittgenstein 1984c, sec. 23)

Right or false is what human beings say; and in the language they agree on. This is no agreement in opinions, but in form of life. (Wittgenstein 1984c, sec. 241)

What is taken for granted, the given, we could say, are forms of life (Wittgenstein 1984, II, p. 572)

In arriving at this idea Wittgenstein was probably influenced by an article by the great anthropologist Bronislaw Malinowski, who suggested that in order to learn the language of a primitive people one needs to share life with them in their society (Malinowski 1989). One example used by Malinowski to illustrate this point can be useful here: when fishermen in the Trobriand Islands use the phrase ‘paddling in a place’, they mean they are navigating close to an island village, and the waters around the islands are so deep that it is not possible to use a pole to propel the canoe, so they need paddle their boats to reach the village. Only by knowing speakers’ life circumstances can we know the information needed to understand what their expressions mean.

The relevance of much that Wittgenstein wrote consists in his having seen the importance and comprehensiveness of some ideas. For him the phrase ‘form of life’ means the way of life in a society. More precisely: the whole complex of regularities that govern the lives of people in a social environment considered in its totality.

We can compare the idea of a form of life with what is involved in two technical terms introduced by J. R. Searle, the network of meanings involved in the determination of an intention, which is linked with the background of abilities, skills, dispositions and ways of doing things (Searle 1983, ch. 5). Though including what Searle means by network and background, the concept of life form is more comprehensive, since even the landscape in which natives were living should be apprehended by the concept and may have an influence on the meaning.

More auspicious may be the comparison between the concept of form of life and Husserl’s concept of life-world (Lebenswelt), which for the latter author is the whole of our shared communal world of human activity (Husserl 1954, Vol. VI, p. 105 f.). For Husserl, the life-world, which can be subdivided in different life-worlds or Home-worlds (Heimwelten), builds the holistic framework within which all other knowledge is acquired, serving therefore as the ultimate foundation of all human cultural endeavors, gradually extending in the scientific ones. Furthermore, although there are different life-worlds, they must have basic common aspects (like spatio-temporality, birth, death, instincts, hunger, thirst, etc.).

Wittgenstein would probably share this view at least in its non-theoretical aspects. It is helpful to see that there must be something common in the most basic levels of our different forms of life. For this communality should be what allows us to be transplanted into a different form of life and nonetheless able to learn its language, once we all share a sufficiently similar human nature.

Tying the threads together

We can now summarize. Language appears in Wittgenstein’s philosophy as an immensely complex system of syntactic, semantic and pragmatic rules: a system that we can subdivide in many ways into subsystems that are called languages, linguistic practices or language games, which are in turn rooted in a wider system, the life form, which is made up of regularities that determine the lives of people in a society. Linguistic practices constituting our ordinary language originate spontaneously from our way of life and depend on it. Here again, we see that creating and learning the specialized language games of science is only possible because it assumes some central practices of ordinary language that are ultimately dependent on life forms. This is also why a computer will never be able to give meaning to the signs with which it operates: a silicon-based machine is a by-product of a life form and not a biological agent naturally growing within it.

We can synthesize the considerations made so far in the following formula:

A meaning of an expression x = any episodic use of x made in accordance with the rules of a proper linguistic practice (the language game) rooted in a proper life form.[12]

This is a characterization of meaning as something that belongs to the praxis of language as it is understood and to our extensions of the concept of use as what is meant in human mental acts. This assimilation of epistemic meaning to action by means of the extended notion of use as a rule-in-its-application is what makes it unnecessary to hypostasize meaning-rules as abstract objects. Meaning is what we think of or speak about as being meaningful; and what we think or speak is meaningful insofar as it is correctly used, namely, used in accordance with the meaning-rules of linguistic practice rooted in our life form.

With this, I believe that we have achieved, based on Wittgenstein’s views, a plausible and minimally distorted surveillable representation of the grammar of the concept of meaning. This representation is particularly important, because it plays a role as a semantic foundation for philosophy as critic of language or therapy.

This is also why a surveillable representation of the grammar of meaning is central to Wittgenstein’s later thought: it is the sustaining core of his philosophy, as much as the doctrine of ideas was the sustaining core of Plato’s philosophy.

Criteria and symptoms again

Another important distinction that we owe to Wittgenstein, already introduced in the first chapter of this book, is the distinction between criteria and symptoms. Semantic-cognitive rules can be seen as criterial rules. Criterial rules are ones based on conditions called criteria. As we also have noted, words like ‘criteria’, ‘symptoms’ and ‘conditions’, have a process-product ambiguity. Usually they mean the external conditions that, once really given, make possible the application of a semantic-cognitive rule. But they can also mean the internal conditions constitutive of the semantic-cognitive rule (criterial rule) that we are able to consider when we suppose or imagine its application.

There is, as we also have noted, a fundamental difference between criteria and symptoms. Criteria are conditions that by convention, once accepted as really given, warrant for us the application of a semantic-cognitive rule; symptoms, on the other hand, are conditions that by convention once accepted as really given make the application of a semantic-cognitive rule only more or less probable. Criteria should establish the necessary and sufficient conditions to the application of an expression. Because of this Wittgenstein also called them definitional criteria; they are primary criteria, while symptoms are also called secondary criteria (Wittgenstein (ed.) 2001, p. 28).

One example makes the distinction clear: a criterion for the application of the predicate word ‘malaria’ is actually finding a bacterium – plasmodium falciparum – in a patient’s blood. If we accept that we have found this, by definition we are warranted in saying that the patient has malaria. But if all we find is that the person has cyclically high fever, we have only a symptom of malaria, something that only makes it probable that the patient has contracted this disease.

Insofar as criteria are understood as internal constitutive conditions of the semantic-cognitive rules for the referential use of a concept-word, they must belong to its meaning, since these rules (whether effectively applied or only cognitively regarded) are constitutive of meaning. As Wittgenstein wrote that the criteria ‘give to the words their common meanings’ (Wittgenstein 1986, p. 57) he should have in mind criterial rules.

Finally, criteria have the role of criteria only in the context of the language games to which they belong. This is the main reason why Wittgenstein says that there can be a grammatical oscillation between criteria and symptoms: with changes in linguistic practice, criteria can become symptoms and vice versa (Wittgensteien 1983c, sec. 79, 354). That is: the same condition that works as a criterion in one practice can serve only as a symptom in another practice and vice versa. And similar changes can also occur as a result of the evolution of language, which may change our conventions, often turning criteria into symptoms and replacing them with new conditions.

The distinction between criteria and symptoms is also important for the critique of language. Philosophers are all too often inclined to confuse criteria with symptoms. To give a very simple example, consider peoples’ facial features and bodies. These are the physical characteristics by means of which we immediately identify people we know. At first sight, it seems that they are the real criteria for the identification of persons. But obviously they aren’t. If a person, as happens in fairy tales, were transformed into a donkey, but continued to behave no differently than before, conversing with us and in full possession of her memories and abilities, we would still cling to the idea that she remains the same person, even though in a different body. This and other similar thought-experiments show that people’s bodily appearances are not primary criteria at all, but only useful symptoms that make their identification very probable. To find the ultimate criteria of personal identity is still today a controversial philosophical problem.[13]

Transgressions of the internal limits of language

In the Tractatus, Wittgenstein was interested in ascertaining what David Pears called the external limits of language and its transgressions. This is relatively easy to spot: a logical contradiction is an external transgression. However, he came to see that most philosophical confusions are caused by the much more subtle transgressions of the internal limits of language. These transgressions happen because our expressions can be used in different linguistic practices, gaining in this way more or less subtle changes in meaning. ‘The place of a word in grammar is its meaning’,[14] he wrote, but this place is changeable and cannot be fixed in advance. Now, when an expression is used simultaneously in different practices, where it should receive a different meaning or meaning-nuance, it turns out to be easier to confuse what we mean with it.

In Wittgenstein’s philosophy we can find two forms of confusion or misleading uses of expressions, which we may call equivocity and hypostasis.[15]

These two forms of transgression have a striking similarity to the psychoanalytic distinction between the two mechanisms of the primary process (primäre Vorgang), called by Sigmund Freud deslocation (Verschiebung) and condensation (Verdichtung). Hence, it is interesting to explain this process here very briefly. According to Freud, psychic life can follow two distinct processes: the secondary process (sekundäre Vorgang) and the primary process (primäre Vorgang). The secondary process is the typically conscious rational thought, like the scientific thought. In this process, affective or emotional charges (Besetzungen) are firmly associated with their respective representations (Vorstellungen). The primary process, on the other hand, is found in dreams, neurotic symptoms, humor, artistic creations, religion, and… philosophy. In these cases, the emotional charges are not rigidly associated with their respective representations (or thoughts) and can be transferred to different representations, insofar as the latter can be associated with the former representations. The primary process is what produces the conscious manifestation of unconscious or pre-conscious thoughts, in the latter case defined as non-repressed and consequently always able to become conscious.

The two fundamental mechanisms of the primary process are displacement and condensation, and they are more clearly understood in Freud’s explanation of how dreams are produced (Freud 1900, ch. 7).

Displacement occurs when the emotional charge of a repressed representation is transferred to another representation, which is able to elude censorship and become conscious, thereby releasing its endo-psychic tension into consciousness. We can say that representation R1, unable to become conscious, has its charge transferred to representation R2, able to deceive censorship and become conscious. A Freudian example of displacement is the story of a Jewish woman who couldn’t marry the man she loved because he was a Christian. However, she dreamed that she gave him her comb. This is her conscious representation in the dream; but the unconscious, repressed representation is the idea of giving herself to him in love. The emotional charge passes from the repressed representation to the innocuous one, which makes it possible for the charge to be released in the dreamer’s consciousness, diminishing the endo-psychic tension.

The mechanism of condensation is somewhat different. Here a representation or group of interrelated representations transfers their affective charges to a partial representation belonging to them, which becomes liberated in consciousness. We can represent this by saying that the charges of {R1, R2… Rn} are usually condensed in one of the representations, say, R2, which enters into consciousness, in this way allowing the release of charges. One example of condensation would be a case in which the woman dreams that the man she loves forgot his scarf in her home... The scarf is part of the whole representation of the man, and the emotional charge associated with the whole is condensed in this partial representation and released into consciousness. It is worth remembering that according to Freud displacement requires unconsciousness by being a product of repression, while condensation requires only pre-consciousness, since it isn’t necessarily a product of repression.

Now, investigation of the two mechanisms by which the internal limits of language are transgressed brings into sharper focus the sometimes noted relation between philosophy as therapy and psychoanalysis (Wisdom 1953), for it shows that philosophical activity is affected not only by a lack of semantic awareness, but also by affirmative unconscious motivations.

Let’s see now how the primary process works in cases of confusion arising from linguistic transgressions of normal uses of expressions. By using an expression equivocally, a philosopher shifts the use of this expression, applying it in the context of a linguistic practice B, though following the semantic rules that this expression should have in linguistic practice A. This equivocity amounts to displacement, since the charges associated with the first use are transferred to a new representation. On the other hand – in what we call hypostasis – the philosopher tries to apply an expression that can be used in two or more linguistic practices, say, A, B and C simultaneously in a certain context, as if there were a single linguistic practice able to join these different uses, adding their emotional charges, when in fact this practice does not exist.

Philosophical examples of these mechanisms can be complicated and difficult to describe, since philosophers, being masters of deception (and self-deception) build their spider webs with far more abstract and complex material than dreamers. Hence, I will consider only two very simple cases. For the case of displacement, consider the following skeptical paradox attributed to the Megarian philosopher Stilpo, denying the possibility of predication. For Stilpo, if I say that Socrates is wise, this is a contradiction, because I am denying that Socrates is Socrates. That is: we can say of something that it is what it is, but if we want to say something more than this, we fall into a contradiction, for we are denying that it is what it is… All that we can do is to express the principle of identity or remain silent.

We can explain Stilpo’s fallacy as due to a failure to distinguish the ‘is’ of predication from the ‘is’ of identity. We can distinguish linguistic practices (contexts) of type A, in which the verb ‘to be’ works as a copula, introducing the predicate (e.g., ‘Socrates is wise’) from linguistic practices of type B, in which the verb ‘to be’ is used in the sense of identity (e.g., ‘Socrates is Socrates’). However, Stilpo uses the verb ‘to be’ as having only one correct use: that which is found in practices of type B. As a result, each time he observes people using the verb ‘to be’ in practice A, he understands their use as following the rule of use that the verb has in practice B – meaning ‘is the same as’ – in this way equivocally ans systematically displacing the real use from practice A to practice B. Since he realizes that the way of use of practice B is contradictory in the context of A, he falsely concludes that predication is impossible.

We will now offer an example of hypostasis in philosophy. Consider the suggestion of a philosopher according to whom the verb ‘to be’ must have a truly originary sense, which is not only that of copula, but also of identity and of existence. To justify this, our philosopher considers the sentence: ‘To be is to be’ (Sein ist Sein). This sentence says not only that ‘to be’ has the property of being, but also that ‘to be’ is the same as ‘to be’, and finally that ‘to be’ has the property of be in existence (something exists). Against this follie metaphysique, a criticist of language will tell us that it is much more plausible to think that what the philosopher seeks with the ‘is’ of the sentence ‘To be is to be’, although grammatically correct, is semantically only an incoherent mixture of different senses of the verb ‘to be’, which were created for different practical purposes. It is a hypostasis: a condensation mixing three different modes of use or meanings of the same word from three truly distinct practices: the predicative practice A, the identifying practice B, and the practice of attributing existence C. In the best case, this is multiple ambiguity; but since the philosopher is claiming to have discovered a way to achieve originary meaning, we diagnose incoherence and illusion.

I give these explanations because in criticizing the metaphysics of reference, we very often denounce equivocity and hypostasis. Wittgenstein has suggested that philosophical maladies have their origins in a ‘craving for generality’, in efforts to achieve generalization in the wrong way, very often reductively influenced by the greater success of natural science (Wittgenstein 1966, p. 1956). We can now suggest that at least in the frequent case of equivocity it may work as a compensatory byproduct of repressing some kind of undesirable knowledge.

An additional point is that striving for generalization is inherent in the philosophical endeavor, even if ultimately doomed to fail. This is, I think, the reason Wittgenstein’s concession that the philosophical bumps up against the walls of language has the mark of profundity (Wittgenstein 1984c, sec. 111). The reason for this is that these confusions, when effective, have the potential to point to relevant issues after forcing us to search for the right way out of the illusions they produce in us. As I intend to show, much of the metaphysics of reference is grounded upon the forms of confusion described above, particularly equivocation (displacement), and can easily be the object of a critique of language.

The form of the semantic-cognitive rules

In an approximate way, we can now expose the general form of a cognitive or criterial semantic rule (basically the identification rule of singular terms, the atributive rule of general terms, and the verificational rule of statements) as being constituted, on one hand by a relation that can be summarized in the sign ‘~>’, which means either an inductive inference (for empirical knowledge) or a deductive inference (for logico-conceptual knowledge). By ‘C’ we mean the criteria assumed as satisfied and, on another hand, by the result ‘R’, the meaning-awareness. Here is a simplified schema:

C ~> R

This schema is very simplified, for the criteria can be multiple, varied and staggered in a procedure. The satisfaction (always in the context of some practice) of a (definitory) criterium under adequate circumstances give place to the occurrence of a meaning-awareness R, which in the case of a statement could be expressed by a sentence. The epistemic content or meaning or sense is the whole procedure of rule-following, including still unverified criteria as far as they also belong to the rule.

Complementing what was said, there is a second cognitive element out of the semantic-cognitive rule, which is our awareness of the consequences of the content – of the rule (or rule-complex), This is what I suggest to be explainable by theories of consciousness: it is what has been called the ‘availability of content to reasoning and action’, the ‘transmission of content for the mind’s global workspace’, the ‘brain celebrity’, etc.

Calling the meaning or epistemic content [{C1 ˅ C2 ˅… ˅ Cn} ~> R] and calling A its cognitive effect (awareness or transmission of content to the system…), we can summarize the usual form of a semantic-cognitive rule added with its effects as follows:

[{C1 ˅ C2 ˅… ˅ Cn} ~> R] ~> A

In order to better understand the rule, suppose that C1 and C2 are assumed criteria for the epistemic result R expressed by the statement ‘Cesar visited Calpurnia.’ The conclusion that Cesar visited Calpurnia is an occurrence or use that can be spelt out or only silently thought.[16] But when we take this R in isolation from any criterion for identifying Cesar or Calpurnia, what we get is only grammatical meaning. We can add informative content to R only when it is at least potentially associated with some procedure from which it results, for instance [{C1} ~> R] when C1 is accepted as satisfied. Finally, it seems inevitable that this procedure of rule-following produces A, that is, an awareness, which would be nothing other than its availability for reasoning and action, its transmission to the mind’s global workspace, etc. given by some theory of consciousness.

It is interesting to note the proximity between our conclusion and inferentialist approaches to meaning. If we say that a content, a semantic-cognitive rule is available for reasoning and action, we also mean that the content (which is in itself inferential) would be inferentially open to any related content. This is what I believe can be understood as the cognitive effect of the satisfaction of the semantic-cognitive rule. I will avoid to risk mixing this inferential openness proper of cognitive awareness of content with the real meaning because inferential openness is a consequence of the instantiation of referential or epistemic meaning, which is won through the application of its often implicit semantic-cognitive inferential rules.

Finally, a semantic-cognitive rule can be (a) only regarded or conceived in its application, only imaginatively applied to some extent, or (b) effectively applied in its domain. This can be explained:

Concerning (a): propositions or thoughts merely regarded in the absence of their judication, the attribution of truth-value. This is a rehearsal of the true application. Here the epistemic meaning, say, the verification rule, can only be regarded or conceived in a Fregean way. But this does not mean that propositions or thoughts are statically regarded as abstract entities; what is meant is that they seem able to be applied in our imagination, even if only in a limited way, with the result that we will be aware of the semantic-cognitive content as an occurence, of the rule-in-its-applicability, even if we may not be reflexively aware of its internal structure, since we are using it as an instrument in the search for the consequences of its satisfaction.

Concerning (b): here semantic-cognitive rules are effectively (and not only in researsal) applied in a choosen domain. In this case, if as a result we spell a sentence internally, adding to it a judicative value and we associate this cognitive application of the rule with its spelling, we ultimately have an assertion: a sentence whose content is given as true. Notice that what is judged or asserted is the whole content: the verification rule with the satisfaction of its criteria.

The purpose of these distinctions isn’t clear now, but I hope that they will gradually justify themselves.

What is wrong with the private language argument?

The so-called private language argument is open to a variety of interpretations. Probably the most consequential one suggests that it presents a strong challenge to the possibility of learning any phenomenalist language understood as a language based on internal identification of phenomenal mental states, such as sensations and emotions, that is, as this language is currently understood.

We can begin with the contrasting case: public physicalist language. How do we learn to identify and distinguish different types of physical objects? For example: how does a child learn to identify references of the word ‘ball’? This doesn’t happen by means of verbal definitions, but by ostension: adults point to examples and say things like, ‘This is a ball’ or ‘That isn’t a ball’... and the child eventually learns what types of objects are round balls. But this learning is only confirmed when a new ball is presented and the child shows adults that it is able to re-identify the object as belonging to the type ball. In this case, based on agreement among other speakers of the language regarding correct re-identification, it is possible for adults and even for a child to know they have learned a rule for identifying ball-type objects. That is, we ultimately know that we have learned a rule after our way of application is confirmed by interpersonal checking.

Consider now what happens when we try to identify internal mental entities of a phenomenal character. In this case, we cannot do any checking of interpersonal re-identifications. Suppose that a person should learn to identify an internal state, for example, feelings of pain. Other people cannot teach her to do this, because they cannot know when she feels pain or how it feels. But let’s suppose that independently of any public language a person is able to point inwardly to some feeling of pain and identify her feeling through a sign that she herself has invented. Suppose this sign is ‘P’. Imagine now that the next time she feels pain, she says to herself ‘P’, intending to point to the same internal mental state. In this case she won’t be able to know if she is really pointing to the same phenomenal state that she initially pointed to, because there are no other speakers who can check the correctness of her rule application, i.e., who are able to confirm or refute her identification. As Wittgenstein notes:

Intersubjective criteria of correction are missing here, and where such criteria do not exist, we cannot distinguish between following a rule and the mere impression of following a rule (1983c, sec. 258-9).

However, this distinction is indispensable because without it we have no way to construct something that we may effectively call ‘a rule’.

Since language is a system of rules, the generalization of this result leads us to the radical conclusion that there cannot be a language whose objects of reference are internal phenomenal states.[17] The only construable language is one based on behavioral expressions of internal states. Wittgenstein concedes the existence of these mental states, rejecting behaviorism, in my view incoherently, since if this is the case, mental states should be beyond the reach of languistic rules and therefore cognitively unspeakable, whereas according to him ‘a nothing is as good as a something about which nothing can be said’ (Wittgenstein 1984, sec. 293). P. F. Strawson, commenting on Wittgenstein’s view, suggested that the latter suffered from an anti-subjectivist prejudice.

The problem, as Ernst Tugendhat once said to me, is that the private language argument is too counterintuitive to be correct. The point, however, is to discover where the weakness of the argument lies. In order to find this, we need to make two things clear. The first is that a rule will only cease to be a rule if its correction is logically impossible. A rule does not cease to be a rule just because for some contingent reason it was not in fact interpersonally checked. After all, it is an indisputable fact that many of the rules we follow, for one reason or another, have never been interpersonally checked. I can invent for myself the rule of never eating creamed spinach and nobody needs to be informed of this. There are rules that for merely circumstantial reasons cannot be checked, such as those made by a shipwrecked sailor who lived and eventually died alone on a remote uninhabited island.

The objection that could be made to this interpretation is that Wittgenstein’s argument demands that any rule, in order to be a rule, must be publicly checked for correctness, and not only able to be publicly corrected (correctable). Even if this interpretation were true, it would be uninteresting. For it expresses only an absurdly implausible and methodologically anti-Wittgensteinian idea: it would compromise our commonsense certainty that we are able to follow rules that have not yet been checked by others. In fact, overstating scepticism, it would also be possible to argue that no rule can be applied in situations where it cannot be subjected to simultaneous intersubjective correction – after all, there is no guarantee that in the absence of this control the rule will be correctly interpreted and applied... However, this gratuitous skepticism is too implausible to affect us.

With this in mind, let us now interpret Wittgenstein’s argument as assuming that the rules of a phenomenal language must be logically incorrigible. Let’s suppose that every morning when waking up I involuntarily follow the rule to remind myself of the first sentence of Dante’s Divine Comedy, but that I always immediately forget doing this. Here we are already close to nonsense, and we would be there if it could be proved to be logically impossible to know if this happens...

We conclude that it is the assumed logical incorrigibility of phenomenal language that gives Wittgenstein’s argument its plausibility: it seems well acceptable to think that a rule whose correction is logically impossible cannot be considered a rule. If the rules of our (supposedly) private phenomenal language are logically incorrigible, it seems that they cannot, ultimately, be distinguished from mere impressions of rules.

This reconstruction of Wittgenstein’s argument is not only the most interesting and reasonable, it also uncovers what I believe to be an important implicit assumption made by him. He noted, for instance, that even though person A’s nervous system could be linked to that of person B, so that A could feel a wasp stinging B’s hand, only the location of pain would be shared, but not the pain itself, because pain felt by A would be A’s pain, while pain felt by B would still be B’s pain (Wittgenstein 1986, p. 54). In his most famous article, surely read by Wittgenstein, Frege noted that if another person could enter our minds to observe a visual representation, the representation she experienced would be her own and not ours (Frege 1892, p. 30). Such considerations lead us to a dogma generally assumed by earlier Twentieth Century analytical philosophers: the thesis that phenomenal states are logically unsharable.[18] If this thesis is correct, then interpersonal corrigibility of phenomenal language would be logically impossible, what would support the private language argument.

At this point, all we need to destroy the private language argument’s foundation is to show that the logical unsharability of phenomenal states is a false principle. That is, we need to show that although the rules of a phenomenal language have never been intersubjectively corrected, they are – contrary to what Wittgenstein and many philosophers assumed – logically corrigible from an interpersonal perspective, this being the hidden flaw that tacitly supports the private language argument.

It’s hard to imagine a thought-experiment showing that phenomenal states are logically shareable. We can begin by making an analogy with computers. Suppose A and B are updated versions of the primitive kind of automata called by Grey Walter machina speculatrix, which fed on light and spent their lives in search of it. Suppose automaton A meets automaton B, and that A is able to read the information content that B has accumulated in its searching. Although automaton A can copy these data first, and only afterwards read them in his own system, so that such ‘contents of experience’ become an unshared part of itself, there is no contradiction in thinking that A can read these ‘contents’ directly in B, as if they were its own, thereby sharing them with automaton B! This would in fact be the simplest and most direct method. Why should we think that in a similar situation we humans would need to be different from machines?

Perhaps it is even possible to imagine that someday there will be two human beings, A* and B*, who somehow share some functioning of parts of their brains. Suppose that their limbic system is essentially the same, while the neocortical regions of A* and B* remain distinct. Now, it seems conceivable that a mental state of pain that occurs in relevant parts of the same limbic system could be shared by subjects A* and B*, even though their conscious interpretation of pain, made in their distinct neocortical regions, are quantitatively different. If we understand pain essentially as a process occurring in a limbic system, then A* and B* really could share the same pain, demonstrating possible interpersonal checking of the same internal phenomenal state.

The thought-experiments considered above suggest that it is logically possible to distinguish:

(a) the subjective interpretation of a phenomenal mental state x

from

(b) the phenomenal mental state x in itself.

This separation in fact seems to be possible. We know cases of hypnosis where people are led to feel pains that do not exist or the case of a patient at the dentist that because of fear believes he feels pain when he in fact feels only the sensation of friction…

Now, if we accept that the separation between (a) and (b) is logically possible, then the interpersonal sharing of mental phenomenal states turns out to be logically possible, which at least in principle makes possible interpersonal checking of identification rules for mental states. In this case, the private language argument fails because the logical unsharability of phenomenal states is a false principle. The rules of phenomenal language acquire an epistemic status similar to the rule I made for myself of never eating creamed spinach again; both could in principle be checked. Consequently, we are entitled to assume that what we only believe to be the rules of our phenomenal language may in fact be the actual rules, since they are at least logically capable of interpersonal correction.

Furthermore, we are also entitled to say that the rules for the identification of phenomenal states are highly probable, since this probability is very well confirmed in an indirect way by a multitude of systematically related associations between interpersonally accessible physical phenomena and reports of internal phenomenal occurrences. For example: if wrinkling the forehead often accompanies the application of the statement ‘I am feeling pain’ when one believes one has a feeling x, wrinkling the forehead indirectly reinforces the probability that the words really refer to the same feeling in their application, if combined with many other already established evidences of pain. Our case is not different from the case of concluding based on a large amount of convincing circumstantial (indirect) evidence that a person was in fact murdered by a psychopath. Even if no one actually saw a murder taking place, a great quantity of circumstantial evidence could be rightly seen by a jury as inductively mutually reinforcing, and taken together as highly convincing (Costa 2011, ch. 5).

Concluding remarks

Returning to our initial question about the nature of the intermediate link, we can now see more clearly why and how the intermediate link between words and things can be read in two different complementary modes, either in the psychological mode, or in the semantic mode in which particular bearers of a link and their psychological singularities are left aside.[19] That is: epistemic meanings are semantic-cognitive rules of application, which can be conceived of in their possible or effective application and that when regarded in terms of their conditions of satisfaction can be called criterial rules. As will be seen later, the epistemic meanings of statements should be nothing but verification rules that apply when the criterial configurations required by them are adequately satisfied, making the statements true.[20]

Nonetheless, it is important to maintain a clear distinction between semantic and psychological, as philosophers like Frege and Husserl insisted. The semantic is conventionally grounded and grammatically necessary; the psychological is spatio-temporally given and physically contingent. But contrary to what these philosophers have supposed, nothing semantic can really exist outside of cognitive instantiations. Semantic entities are nothing more than conventional structures that exist only when embodied in mental acts, even if considered in abstraction from their contingent bearers. To assume that semantic entities can exist without any psychological basis is to hypostasize their nature.[21]

[1] This cognitive versus semantic dichotomy can be traced in history at least as far back as Aristotle, who saw the intermediary link as an affection of soul (ton en têi psychêi pathêmáton) or thought (noêmata) – a psychological perspective – while Stoics, who appealed to ‘what is said’ (lectón) or ‘what is meant’ (semainómenon), associated the intermediary link in some way with language – a semanticist view. (Manetti 1993, p. 93 ff.)

[2] Of course one could also do the same thing without drawing on color memory: suppose that people carry with them templates of vermilion and compare the patches of color they see with these templates when necessary. Indispensable is the existence of some empirically given external model.

[3] Qualitative identity is the identity between different things; it is opposed to numeric identity, which is the identity of a thing with itself.

[4] It is true that this last ‘any’ allows us to infer that there is a class, the class of all tokens that are qualitative identical, but this class does not belong to the definition and does not need to be known by anyone.

[5] As the earlier Wittgenstein wrote: ‘The name means its object. The object is its meaning.’ (1093c sec. 3.203).

[6] The view was ironized by Gilbert Ryle as the ‘Fido-Fido’ theory of meaning. See his article ‘The Theory of Meaning’ (1957).

[7] As Russell recognizes, logical atomism was first suggested by Wittgenstein, who defended it in a full-fledged way in his Tractatus Logico-Philosophicus.

[8] This kind of difficulty already appears clearly in the final public discussion of Russell’s speech in ‘The Philosophy of Logical Atomism’, p. 203. (For criticism see Tugendhat 1976, p. 382 and Kripke 2013, pp. 15-16.)

[9] See my discussion of Wittgenstein’s private language argument later in this chapter.

[10] One could object that since there are many different shades of red (one of them being vermilion), red cannot be simple. But we can answer that what we call ‘simple’ depends on whatever system we have adopted: we can use an old language game with only three basic colors: red, yellow and blue. Here red will be considered simple; and in this case the distinct shades of red will not be taken into account, even if they are perceptually distinguishable. Instead of being precisely similar to the pattern, a new red patch must only be sufficiently similar, there being limits determined relative to the other two colors.

[11] Language not only has a communicational function, but also an organizational function, in the sense that we also use it to think, to organize our ideas and our plans of action (Vygotsky). At first sight, the identification of meaning with use doesn’t seem to do justice to its organizational function. But it doesn’t have to be so. It makes sense to say that when I think that the Eiffel Tower is made of metal, I am using this name referentially in my mind, in thought, in a dialogue with myself.

[12] See (Costa, 1995 ch. 1). The assumption that grounds my reconstruction is that Wittgenstein was not making repeated attempts to explain the nature of meaning, which always ended in some kind of failure, being then replaced by another, as some interpreters seem to believe. What he did was to develop different approximative, often analogical suggestions, each addressing the same issue from a new perspective, such suggestions being largely complementary, each with the other. In this way, it is possible to find continuity in Wittgenstein’s semantic conceptions, which began with the Notebooks 1914-1916 and ended with On Certainty.

[13] For an attempted solution see (Costa 2011, ch. 5).

[14] Wittgenstein, Philosophische Grammatik, sec. 23.

[15] Anthony Kenny also noted the existence of two forms of misusing expressions in his book, Wittgenstein (Cambridge, MA: Harvard University Press, 1973)

[16] C. S. Peirce’s view, according to which all thought is in signs seems to be wrong when we consider that we are effectively able to think without using words. But it is plausible that in having these thoughts we are unconsciously using signs that are, if not linguistic, at least imagetic or emotive.

[17] ‘Private language argument’ isn’t a term chosen by Wittgenstein, and it would be naïve to believe that there is only one possible way of reading his remarks often pointing to incompatible directions. Here I reconstruct the so-called private language argument in a way that makes its results as philosophically strong as it would be reasonably possible to make them, deriving from this argument the destruction of human subjectivity as it is currently understood; a private language argument with trivial conclusions would be of scant interest.

[18] See, for instance, A. J. Ayer 1972, p. 196.

[19] While semantic theories like that of Davidson fall short of the mark, the Gricean psychological theory of meaning misses the mark. What H. P. Grice elucidates with the suggestion that the meaning of the speaker’s utterance p is recognition by its hearer of the speaker’s intention to say p is not the epistemic meaning of utterance p, but only part of the procedure whereby the same meaning is communicated. (See Grice 1991, ch. 5, 6, 14, 18) In Lesson 14 of his Vorlesungen zur Einführung in die sprachanalytische Philosophie, Ernst Tugendhat convincingly criticizes Grice’s attempts to explain the meaning of utterances in this way.

[20] Note that there are non-referential cognitive rules: we can have rules that relate (a) the empirical date of language to cognitions, (b) cognitions to other cognitions, and (c) cognitions to actions. But as to the issue of reference, what matters is the first kind of rule, which is responsible for referential meaning.

[21] There are in my judgment a variety of ways to make these hypostases. One of them is to identify sense/meaning with Platonic entities (Frege, Husserl); another (which will be criticized in due time) is to identify linguistic meaning with essential kinds of external things (Putnam); another is to identify meaning with minimum units of reference (Russell); and yet another is to identify meaning with merely psychological intentions (Grice).

ADVERTISEMENT FOR "PHILOSOPHICAL SEMANTICS": WITTGENSTEIN

Tuesday, July 5, 2016

Claudio Costa, "PHILOSOPHICAL SEMANTICS" - CHAPTER ON WITTGENSTEINS PHILOSOPHY