we can apply filter and let it represent what we want , first filter can be used to filter "polite thing" and so on. we have 2 channels for each kernel size (2,3,4) -> output size is (4,5,6) -> 1 max pooling from each channel and concatenate -> concatenate whole and put it into softmax to tell positve or negative. Regularization Dropout = Create masking vector r of Bernoulli random variable with..