SQL-Server
What is normalization?
It is set of rules that have been established to aid in the design of tables that are meant to be connected
through relationships. This set of rules is known as Normalization. Benefits of normalizing your database
will include:
1. Avoiding repetitive entries
2. Reducing required storage space
3. Preventing the need to restructure existing tables to accommodate new data.
4. Increased speed and flexibility of queries, sorts, and summaries.
What are different types of normalization?
Following are the three normal forms :-
1. First Normal - Form For a table to be in first normal form, data must be broken up into the smallest
units possible.In addition to breaking data up into the smallest meaningful values, tables in first normal
form should not contain repetitions groups of fields.
2. Second Normal form - The second normal form states that each field in a multiple field primary
keytable must be directly related to the entire primary key. Or in other words, each non-key field should
be a fact about all the fields in the primary key.
3. Third normal - form A non-key field should not depend on other Non-key field.
What is denormalization?
Denormalization is the process of putting one fact in numerous places (its vice-versa of
normalization).Only one valid reason exists for denormalizing a relational design - to enhance
performance.The sacrifice to performance is that you increase redundancy in database.
What is a candidate key?
A table may have more than one combination of columns that could uniquely identify the rows in a table;
each combination is a candidate key. During database design you can pick up one of the candidate keys
to be the primary key. For example, in the supplier table supplierid and suppliername can be candidate
key but you will only pick up supplierid as the primary key.
What are the different types of joins? What is the difference between them?
1. Inner join - Inner Join is the default type of join, it will producesses the result set, which contains
matched rows only.
Syntax: select * from table1<innerjoin>table2.
2. Outer Join - Outer join produces the results, which contains matched rows and unmatched rows. Here
we have three types of joins:-
1.Left Outer Join 2.Right Outer Join 3.Full Outer Join.
Left Outer Join: Left Outer Join producesses the results, which contains all the rows from Left table and
matched rows from Right Table. Syntax: select * from table1<leftouterjoin>table2.
Right Outer Join: Right Outer Join producesses the resultset, which contains all the rows from right table
and matched rows from left table. syntax:select * from table1<right outer join>table2.
Full Outer Join: Full Outer Join producesses the resultset, which contains all the rows from left table and
all the rows from right table. syntax:select * from table1<fullouterjoin>table2.
3. Cross Join - A join without having any condition is known as Cross Join, in cross join every row in
first table is joins with every row in second table. syntax: select * from table1<cross join>table2
Self Join: A join joins withitself is called self join working with self joins we use Alias tables
What are indexes? What is the difference between clustered and nonclustered indexes?
Indexes in SQL Server are similar to the indexes in books. They help SQL Server retrieve the data
quickly. There are clustered and nonclustered indexes. A clustered index is a special type of index that
reorders the way in which records in the table are physically stored. Therefore table can have only one
clustered index. The leaf nodes of a clustered index contain the data pages. A nonclustered index is a
special type of index in which the logical order of the index does not match the physical stored order of
the rows on disk. The leaf node of a nonclustered index does not consist of the data pages. Instead, the
leaf nodes contain index rows.
How can you increase SQL performance?
Following are tips which will increase your SQl performance :-
1. Every index increases the time takes to perform INSERTS, UPDATES and DELETES, so the number
of indexes should not be too much. Try to use maximum 4-5 indexes on one table, not more. If you have
read-only table, then the number of indexes may be increased.
2. Keep your indexes as narrow as possible. This reduces the size of the index and reduces the number of
reads required to read the index.
3. Try to create indexes on columns that have integer values rather than character values.
4. If you create a composite (multi-column) index, the order of the columns in the key are very
important. Try to order the columns in the key as to enhance selectivity, with the most selective columns
to the leftmost of the key.
5. If you want to join several tables, try to create surrogate integer keys for this purpose and create
indexes on their columns.
6. Create surrogate integer primary key (identity for example) if your table will not have many insert
operations.
7. Clustered indexes are more preferable than nonclustered, if you need to select by a range of values or
you need to sort results set with GROUP BY or ORDER BY.
8. If your application will be performing the same query over and over on the same table, consider
creating a covering index on the table.
9. You can use the SQL Server Profiler Create Trace Wizard with "Identify Scans of Large Tables" trace
to determine which tables in your database may need indexes. This trace will show which tables are
being scanned by queries instead of using an index.
What is the use of OLAP?
OLAP is useful because it provides fast and interactive access to aggregated data and the ability to drill
down to detail.
What is a measure in OLAP?
Measures are the key performance indicator that you want to evaluate. To determine which of the
numbers in the data might be measures. A rule of thumb is: If a number makes sense when it is
aggregated, then it is a measure.
What are dimensions in OLAP?
Dimensions are the categories of data analysis. For example, in a revenue report by month by sales
region, the two dimensions needed are time and sales region. Typical dimensions include product, time,
and region.
What are levels in dimensions?
Dimensions are arranged in hierarchical levels, with unique positions within each level. For example,
a time dimension may have four levels, such as Year, Quarter, Month, and Day. Or the dimension
might have only three levels, for example, Year, Week, and Day. The values within the levels are called
members. For example, the years 2002 and 2003 are members of the level Year in the Time dimension.
What are fact tables and dimension tables in OLAP?
The dimensions and measures are physically represented by a star schema. Dimension tables revolve
around fact table. A fact table contains a column for each measure as well as a column for each
dimension. Each dimension column has a foreign-key relationship to the related dimension table, and the
dimension columns taken together are the key to the fact table.
What is DTS?
DTS is used to pull data from various sources into the star schema.
What is fillfactor?
The 'fill factor' option specifies how full SQL Server will make each index page. When there is no
free space to insert new row on the index page, SQL Server will create new index page and transfer
some rows from the previous page to the new one. This operation is called page splits. You can reduce
the number of page splits by setting the appropriate fill factor option to reserve free space on each
index page. The fill factor is a value from 1 through 100 that specifies the percentage of the index
page to be left empty. The default value for fill factor is 0. It is treated similarly to a fill factor value of
100, the difference in that SQL Server leaves some space within the upper level of the index tree for
FILLFACTOR = 0. The fill factor percentage is used only at the time when the index is created. If the
table contains read-only data (or data that very rarely changed), you can set the 'fill factor' option to 100.
When the table's data is modified very often, you can decrease the fill factor to 70% or whatever you
think is best.
What is RAID and how does it work?
Redundant Array of Independent Disks (RAID) is a term used to describe the technique of improving
data availability through the use of arrays of disks and various data-striping methodologies. Disk arrays
are groups of disk drives that work together to achieve higher data-transfer and I/O rates than those
provided by single large drives. An array is a set of multiple disk drives plus a specialized controller (an
array controller) that keeps track of how data is distributed across the drives. Data for a particular file
is written in segments to the different drives in the array rather than being written to a single drive. For
speed and reliability, it is better to have more disks. When these disks are arranged in certain patterns
and are use a specific controller, they are called a Redundant Array of Inexpensive Disks (RAID) set.
There are several numbers associated with RAID, but the most common are 1, 5 and 10. RAID 1 works
by duplicating the same writes on two hard drives. Let us assume you have two 20 Gigabyte drives. In
RAID 1, data is written at the same time to both the drives. RAID1 is optimized for fast writes. RAID
5 works by writing parts of data across all drives in the set (it requires at least three drives). If a drive
failed, the entire set would be worthless. To combat this problem, one of the drives stores a "parity" bit.
Think of a math problem, such as 3 + 7 = 10. You can think of the drives as storing one of the numbers,
and the 10 is the parity part. By removing any one of the numbers, you can get it back by referring to
the other two, like this: 3 + X = 10. Of course, losing more than one could be evil. RAID 5 is optimized
for reads. RAID 10 is a bit of a combination of both types. It doesn't store a parity bit, so it is faster, but
it duplicates the data on two drives to be safe. You need at least four drives for RAID 10. This type of
RAID is probably the best compromise for a database server.
What is the difference between DELETE TABLE and TRUNCATE TABLE commands?
Following are difference between them :-
1. DELETE TABLE syntax logs the deletes thus make the delete operation slow. TRUNCATE table
does not log any information but it logs information about deallocation of data page of the table so
TRUNCATE table is faster as compared to delete table.
2. DELETE table can have criteria while TRUNCATE can not.
3. TRUNCATE table can not fire trigger.
What are the problems that can occur if you do not implement locking properly in SQL SERVER?
Following are the problems that occur if you do not implement locking properly in SQL SERVER:-
1. Lost Updates Lost updates occur if you let two transactions modify the same data at the same time,
and the transaction that completes first is lost. You need to watch out for lost updates with the READ
UNCOMMITTED isolation level. This isolation level disregards any type of locks, so two simultaneous
data modifications are not aware of each other. Suppose that a customer has due of 2000$ to be paid. He
pays 1000$ and again buys a product of 500$. Lets say that these two transactions are now been entered
from two different counters of the company. Now both the counter user starts making entry at the same
time 10:00 AM. Actually speaking at 10:01 AM the customer should have 2000$-1000$+500 = 1500$
pending to be paid. But as said in lost updates the first transaction is not considered and the second
transaction overrides it. So the final pending is 2000$+500$ = 2500$.
2. Non-Repeatable Read - Non-repeatable reads occur if a transaction is able to read the same row
multiple times and gets a different value each time. Again, this problem is most likely to occur with the
READ UNCOMMITTED isolation level. Because you let two transactions modify data at the same time,
you can get some unexpected results. For instance, a customer wants to book flight, so the travel agent
checks for the flights availability. Travel agent finds a seat and goes ahead to book the seat.While the
travel agent is booking the seat, some other travel agent books the seat. When this travel agent goes to
update the record, he gets error saying that "Seat is already booked". In short the travel agent gets
different status at different times for the seat. Dirty Reads Dirty reads are a special case of non-
repeatable read. This happens if you run a report while transactions are modifying the data that you're
reporting on. For example there is a customer invoice report which runs on 1:00 AM in afternoon and
after that all invoices are sent to the respective customer for payments. Let us say one of the customer
has 1000$ to be paid. Customer pays 1000$ at 1:00 AM and at the same time report is run. Actually
customer has no money pending but is still issued an invoice. Phantom Reads Phantom reads occur due
to a transaction being able to read a row on the first read, but not being able to modify the same row due
to another transaction deleting rows from the same table. Lets say you edit a record in the mean time
somebody comes and deletes the record, you then go for updating the record which does not exist.
Interestingly, the phantom reads can occur even with the default isolation level supported by SQL
Server: READ COMMITTED. The only isolation level that doesn't allow phantoms is SERIALIZABLE,
which ensures that each transaction is completely isolated from others. In other words, no one can
acquire any type of locks on the affected row while it is being modified.
What are different transaction levels in SQL SERVER?
Transaction Isolation level decides how is one process isolated from other process.Using transaction
levels you can implement locking in SQL SERVER. There are four transaction levels in SQL
SERVER :-
1. READ COMMITTED - The shared lock is held for the duration of the transaction, meaning that no
other transactions can change the data at the same time. Other transactions can insert and modify data in
the same table, however, as long as it is not locked by the first transaction.
2. READ UNCOMMITTED - No shared locks and no exclusive locks are honored. This is the least
restrictive isolation level resulting in the best concurrency but the least data integrity.
3. REPEATABLE READ - This setting disallows dirty and non-repeatable reads. However, even though
the locks are held on read data, new rows can still be inserted in the table, and will subsequently be
interpreted by the transaction.
4. SERIALIZABLE This is the most restrictive setting holding shared locks on the range of data. This
setting does not allow the insertion of new rows in the range that is locked; therefore, no phantoms are
allowed. Following is the syntax for setting transaction level in SQL SERVER. SET TRANSACTION
ISOLATION LEVEL SERIALIZABLE
What are the different locks in SQL SERVER?
Depending on the transaction level six types of lock can be acquired on data :- 314 Intent The intent lock
shows the future intention of SQL Server's lock manager to acquire locks on a specific unit of data for a
particular transaction. SQL Server uses intent locks to queue exclusive locks, thereby ensuring that these
locks will be placed on the data elements in the order the transactions were initiated. Intent locks come in
three flavors: intent shared (IS), intent exclusive (IX), and shared with intent exclusive (SIX). IS locks
indicate that the transaction will read some (but not all) resources in the table or page by placing shared
locks. IX locks indicate that the transaction will modify some (but not all) resources in the table or page
by placing exclusive locks. SIX locks indicates that the transaction will read all resources, and modify
some(but not all) of them. This will be accomplished by placing the shared locks on the resources read
and exclusive locks on the rows modified. Only one SIX lock is allowed per resource at one time;
therefore, SIX locks prevent other connections from modifying any data in the resource (page or table),
although they do allow reading the data in the same resource. Shared Shared locks (S) allow transactions
to read data with SELECT statements. Other connections are allowed to read the data at the same time;
however, no transactions are allowed to modify data until the shared locks are released. Update Update
locks (U) are acquired just prior to modifying the data. If a transaction modifies a row, then the update
lock is escalated to an exclusive lock; otherwise, it is converted to a shared lock. Only one transaction
can acquire update locks to a resource at one time. Using update locks prevents multiple connections
from having a shared lock that want to eventually modify a resource using an exclusive lock. Shared
locks are compatible with other shared locks, but are not compatible with Update locks. Exclusive
Exclusive locks (X) completely lock the resource from any type of access including reads. They are
issued when data is being modified through INSERT, UPDATE and DELETE statements. 315 Schema
Schema modification locks (Sch-M) are acquired when data definition language statements, such as
CREATE TABLE, CREATE INDEX, ALTER TABLE, and so on are being executed. Schema stability
locks (Sch-S) are acquired when store procedures are being compiled. Bulk Update Bulk update locks
(BU) are used when performing a bulk-copy of data into a table with TABLOCK hint. These locks
improve performance while bulk copying data into a table; however, they reduce concurrency by
effectively disabling any other connections to read or modify data in the table.
Can we suggest locking hints to SQL SERVER?
We can give locking hints that helps you over ride default decision made by SQL Server. For instance,
you can specify the ROWLOCK hint with your UPDATE statement to convince SQL Server to lock each
row affected by that data modification. Whether it's prudent to do so is another story; what will happen
if your UPDATE affects 95% of rows in the affected table? If the table contains 1000 rows, then SQL
Server will have to acquire 950 individual locks, which is likely to cost a lot more in terms of memory
than acquiring a single table lock. So think twice before you bombard your code with ROWLOCKS.
What is LOCK escalation?
Lock escalation is the process of converting of low level locks (like row locks, page locks) into higher
level locks (like table locks). Every lock is a memory structure too many locks would mean, more
memory being occupied by locks. To prevent this from happening, SQL Server escalates the many fine-
grain locks to fewer coarse-grain locks. Lock escalation threshold was definable in SQL Server 6.5, but
from SQL Server 7.0 onwards it's dynamically managed by SQL Server.
What are the different ways of moving data/ databases between servers and databases in SQL
Server?
There are lots of option available; you have to choose your option depending upon your requirements.
Some of the options you have are: BACKUP/RESTORE, detaching and attaching databases, replication,
DTS, BCP, logshipping, INSERT...SELECT, SELECT...INTO, creating INSERT scripts to generate
data.
What is the difference between a HAVING CLAUSE and a WHERE CLAUSE?
You can use Having Clause with the GROUP BY function in a query and WHERE Clause is applied to
each row before they are part of the GROUP BY function in a query.
What is the difference between UNION and UNION ALL SQL syntax?
UNION SQL syntax is used to select information from two tables. But it selects only distinct records
from both the table, while UNION ALL selects all records from both the tables.
How can you raise custom errors from stored procedure?
The RAISERROR statement is used to produce an ad hoc error message or to retrieve a custom message
that is stored in the sysmessages table. You can use this statement with the error handling code presented
in the previous section to implement custom error messages in your applications. The syntax of the
statement is shown here. RAISERROR ({msg_id |msg_str }{,severity ,state } [ ,argument [ ,,...n ] ] )
) [ WITH option [ ,,...n ] ] A description of the components of the statement follows. msg_id :-The ID
for an error message, which is stored in the error column in sysmessages. msg_str :-A custom message
that is not contained in sysmessages. severity :- The severity level associated with the error. The valid
values are 0–25. Severity levels 0–18 can be used by any user, but 19–25 are only available to members
of the fixed-server role sysadmin. When levels 19–25 are used, the WITH LOG option is required. state
A value that indicates the invocation state of the error. The valid values are 0–127. This value is not used
by SQL Server. Argument, One or more variables that are used to customize the message. For example,
you could pass the current process ID (@@SPID) so it could be displayed in the message. WITH option,
The three values that can be used with this optional argument are described here.
LOG - Forces the error to log in the SQL Server error log and the NT application log.
NOWAIT - Sends the message immediately to the client.
SETERROR - Sets @@ERROR to the unique ID for the message or 50,000. The number of options
available for the statement makes it seem complicated, but it is actually easy to use. The following
shows how to create an ad hoc message with a severity of 10 and a state of 1. RAISERROR ('An error
occurred updating the NonFatal table',10,1) --Results-- An error occured updating the NonFatal table
The statement does not have to be used in conjunction with any other code, but for our purposes it will
be used with the error handling code presented earlier. The following alters the ps_NonFatal_INSERT
procedure to use RAISERROR.
USE tempdb go
ALTER PROCEDURE ps_NonFatal_INSERT
@Column2 int =NULL
AS
DECLARE @ErrorMsgID int
INSERT NonFatal VALUES (@Column2)
SET @ErrorMsgID=@@ERROR
IF @ErrorMsgID <>0
BEGIN RAISERROR ('An error occured updating the NonFataltable',10,1)
END
When an error-producing call is made to the procedure, the custom message is passed to the client.
What is ACID fundamental? What are transactions in SQL SERVER?
A transaction is a sequence of operations performed as a single logical unit of work. A logical unit of
work must exhibit four properties, called the ACID (Atomicity, Consistency, Isolation, and Durability)
properties, to qualify as a transaction: Atomicity
1. A transaction must be an atomic unit of work; either all of its data modifications are performed or
none of them is performed. Consistency
2. When completed, a transaction must leave all data in a consistent state. In a relational database, all
rules must be applied to the transaction's modifications to maintain all data integrity. All internal data
structures, such as B-tree indexes or doubly-linked lists, must be correct at the end of the transaction. 320
Isolation
3. Modifications made by concurrent transactions must be isolated from the modifications made by any
other concurrent transactions. A transaction either see data in the state it was before another concurrent
transaction modified it, or it sees the data after the second transaction has completed, but it does not see
an intermediate state. This is referred to as serializability because it results in the ability to reload the
starting data and replay a series of transactions to end up with the data in the same state it was in after the
original transactions were performed. Durability
4. After a transaction has completed, its effects are permanently in place in the system. The modifications
persist even in the event of a system failure.
What is the purpose of Replication?
Replication is way of keeping data synchronized in multiple replication has two important aspects
publisher and subscriber.Publisher Database server that makes data available for replication is called as
Publisher. Subscriber Database Servers that get data from the publishers is called as Subscribers.
What are the different types of replication supported by SQL SERVER?
There are three types of replication supported by SQL SERVER:-
1. Snapshot Replication - Snapshot Replication takes snapshot of one database and moves it to the other
database. After initial load data can be refreshed periodically. The only disadvantage of this type of
replication is that all data has to be copied each time the table is refreshed.
2. Transactional Replication - In transactional replication data is copied first time as in snapshot
replication, but later only the transactions are synchronized rather than replicating the whole
database.You can either specify to run continuously or on periodic basis.
3. Merge Replication - Merge replication combines data from multiple sources into a single central
database.Again as usual the initial load is like snapshot but later it allows change of data both on
subscriber and publisher, later when they come on-line it detects and combines them and updates
accordingly.
What are the different types of triggers in SQl SERVER 2000?
There are two types of triggers :-
1. INSTEAD OF triggers - INSTEAD OF triggers fire in place of the triggering action. For example,
if an INSTEAD OF UPDATE trigger exists on the Sales table and an UPDATE statement is executed
against the Salestable, the UPDATE statement will not change a row in the sales table. Instead, the
UPDATE statement causes the INSTEAD OF UPDATE trigger to be executed, which may or may not
modify data in the Sales table. 326
2. AFTER triggers - AFTER triggers execute following the SQL action, such as an insert, update,
or delete. This is the traditional trigger which existed in SQL SERVER. INSTEAD OF triggers gets
executed automatically before the Primary Key and the Foreign Key constraints are checked, whereas
the traditional AFTER triggers gets executed after these constraints are checked. Unlike AFTER triggers,
INSTEAD OF triggers can be created on views.
If we have multiple AFTER Triggers on table how can we define the sequence of the triggers?
If a table has multiple AFTER triggers, then you can specify which trigger should be executed first and
which trigger should be executed last using the stored procedure sp_settriggerorder. All the other triggers
are in an undefined order which you cannot control.
What is SQl injection?
It is a Form of attack on a database-driven Web site in which the attacker executes unauthorized SQL
commands by taking advantage of insecure code on a system connected to the Internet, bypassing the
firewall. SQL injection attacks are used to steal information from a database from which the data would
normally not be available and/or to gain access to an organization’s host computers through the computer
that is hosting the database. SQL injection attacks typically are easy to avoid by ensuring that a system
has strong input validation. As name suggest we inject SQL which can be relatively dangerous for the
database. Example this is a simple SQL
SELECT email,passwd, login_id, full_name FROM members WHERE email = 'x'.
Now somebody does not put "x" as the input but puts "x ; DROP TABLE members;". So the actual SQL
which will execute is :-
SELECT email, passwd, login_id, full_name FROM members WHERE email = 'x' ; DROP TABLE
members;
What is the difference between Stored Procedure (SP) and User Defined Function (UDF)?
Following are some major differences between a stored procedure and user defined functions:-
1. UDF can be executed using the "SELECT" clause while SP’s can not be.
2. UDF can not be used in XML FOR clause but SP’s can be used.
3. UDF does not return output parameters while SP’s return output parameters.
4. If there is an error in UDF its stops executing. But in SP’s it just ignores the error and moves to the
next statement.
5. UDF can not make permanent changes to server environments while SP’s can change some of the
server environment.
What is Recursive Queries and Common Table Expressions(Sql Server 2005)?
Common Table Expressions (CTE) evolved from SQL Server's derived tables. CTEs are refered to in the
FROM clause of a query, just like a derived table or view, but unlike a view, CTEs are non-persistent.
CTEs share the life of the full query to which they belong. The big improvement here is support for
recursion on, in effect, derived tables. With a derived table, you cannot create it once and use it several
times in a query, and it cannot refer to itself in a statement. CTEs overcome this. It's an excellent feature
with major potential for report writing. This is one of the many low-level feature additions in the BI push
in SQL Server 2005.
What are PIVOT and UNPIVIT Operators(Sql Server 2005)?
PIVOT and UNPIVOT are relational operators that, like CTEs, are geared toward BI applications. To
put it briefly, PIVOT provides the basic operation required for crosstabs, effectively converting columns
into rows. UNPIVOT does the reverse, converting rows into columns. This allows you to perform quick
crosstabs without having to use OLAP. This is a great feature for report building!
What are APPLY, CROSS APPLY and OUTER APPLY Operators(Sql Server 2005)?
APPLY is a relational operator that allows you to use a table-valued function once for each row of outer
table expressions. You use APPLY in the FROM clause much like a JOIN, and it comes in two forms -
CROSS APPLY and OUTER APPLY. CROSS APPLY invokes a table-valued function for each row in
an outer table expression. OUTER APPLY is similar but it will return rows from the outer table where
the table-valued function returned an empty set. This is a reasonably useful feature, but not life changing
for most SQL programmers.
How to handle Exceptions in Transactions(Sql Server 2005)?
In an effort to render the GOTO statements of previous TSQL implementations useless, SQL Server
2005 implements TRY/CATCH blocks. These blocks work as expected, just like blocks of the same
type in C# or VB.NET. This is a good feature for multipart transactions and will definitely demonstrate
increased value for developers. The new features in TSQL offer a lot of benefit for SQL programmers
and definitely warrant poking around with the server. Next up is the expanded XML support provided by
SQL Server 2005. There is now an XML data type with methods built to operate on that data as well as
support for XQuery and DML Extensions.
How Sql support for XML Data Type(Sql Server 2005)?
Standard SQL supports only simple scalar datatypes. XML, on the other hand, is capable of modeling
any type of data. The XML datatype allows you to store XML in the database as a supported data type,
as opposed to simply storing it as a string. With the XML data type, you can query, validate, and modify
the contents of an XML document in the CML data type. It "integrates traditional, relational data with
data in unstructured or semi-structured XML documents in ways that are not possible with SQL Server
2000." I have to take issue with this because SQL is about data storage and retrieval. SQL is not about
data processing, especially not XML processing. I'll talk of this more later in the article.
What are SQL Management Objects (SMO)(Sql Server 2005)?
The SMO model is the management object model for SQL Server 2005. It is based on .NET Framework
managed code. It's the primary tool for creating database management applications using the .NET
Framework, and is used by every dialog box in the SQL Management Studio. SMO replaces SQL-DMO
but, where possible, implements similar objects for ease of use. SMO has a much larger array of features,
however, adding over 150 classes. Microsoft says the big advantages of using SMO are its performance
and scalability. It uses a cached object model, meaning you can change multiple object properties
before a call to the server is made. This is a nice management feature that will definitely be useful for
developers of custom database tools.
What are the new Application Framework - Notifications, Reporting and Mobile Services
I chose to sum these up in one section because the objectives are all similar - to provide enhanced BI
and reporting services. The big feature add for notifications and messaging is the SQL Service Broker,
built specifically to provide a scalable architecture for building asynchronous message routing. It allows
an internal or external process to send and receive streams of reliable, asynchronous messages by using
the extensions to normal TSQL data manipulation. The new reporting services are based on the recently
acquired ActiveViews technology to provide developers with a complete set of tools for creating,
managing, and viewing reports. They also provide an engine for hosting and processing reports and an
open interface based architecture for embedding reports. The new mobile services allow use of objects
similar to the core ADO.NET objects for CE environments, as well as use of DTS and parameterized
queries. All together, these make a nice foundation for building business applications - but once again,
they don't necessarily belong in the database.
CLR Integration in Sql Server 2005
The last and most important change to SQL Server is the integration of the .NET CLR into the database
server query environment. According to Microsoft, "Using common language runtime (CLR) integration,
you can code your stored procedures, functions, and triggers in the .NET Framework language of your
choice." I suppose this is great for .NET programmers, but what about the rest of the world using SQL
Server with Java, PHP, COBOL, ADA, etc? How about this line: "Many tasks that were awkward or
difficult to perform in Transact-SQL can be better accomplished by using managed code…" The idea
here is that developers can perform complex processing on their data before the next application tier ever
sees it, but I must ask - what's the point? Those tasks are not database tasks! SQL is for data management
and retrieval, not output formatting, mathematical computations, graphics or list processing. Why keep
piling on the features? Why dilute SQL with extension upon extension until you have this big mess that
does nothing well? Please humor me while I digress.
No comments:
Post a Comment